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Motivated by the need to correct the potentially large kinematic errors in approximations used 
in the standard formulation of perturbative QCD, we reformulate deeply inelastic lepton- proton 
scattering in terms of gauge invariant, universal parton correlation functions which depend on all 
components of parton four-momentum. Currently, different hard QCD processes are described by 
very different perturbative formalisms, each relying on its own set of kinematical approximations. In 
this paper we show how to set up formalism that avoids approximations on final-state momenta, and 
thus has a very general domain of applicability. The use of exact kinematics introduces a number 
of significant conceptual shifts already at leading order, and tightly constrains the formalism. We 
show how to define parton correlation functions that generalize the concepts of parton density, 
fragmentation function, and soft factor. After setting up a general subtraction formalism, we obtain 
a factorization theorem. To avoid complications with Ward identities the full derivation is restricted 
to Abelian gauge theories; even so the resulting structure is highly suggestive of a similar treatment 
for non-Abelian gauge theories. 
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I. INTRODUCTION 

The standard leading twist formalism for calculating 
deeply inelastic lepton-proton scattering (DIS) cross sec- 
tions provides a foundation for understanding parton dis- 
tribution functions (PDFs) and perturbative Quantum 
Chromodynamics (pQCD) in general. At issue in the 
present paper are the various approximations to par- 
ton kinematics that are fundamental to the standard 
approach, as we will review in Sec. |TTJ These approxi- 
mations are appropriate for very inclusive cross sections, 
and they result in a number of appealing conceptual and 
practical simplifications. The resulting factorization the- 
orems involve standard (fully integrated) PDFs that have 
rigorous definitions and depend only on the longitudinal 
component of the parton momentum [l], Q ■ 

But when the true final states are studied in more 
detail, problems arise. Although various resummation 
methods are used to overcome the problems, it has be- 
come increasing ly clear jHHH that it is the standard 
kinematic approximations that should be questioned. 
The problems arise because the approximations change 
momenta of particles in the final state, typically resulting 
in a final state that does not obey 4-momentum conser- 
vation. For a fully inclusive cross section, this is not a 
critical issue. Furthermore, if one is only concerned with 
obtaining a leading-logarithm approximation (improved 
only by the use of an appropriate running coupling), it is 
legitimate to be imprecise about the details of the kine- 
matic approximations. 

But to be able to make systematic improvements by 
including higher order corrections to the hard scattering, 
and to the showering and evolution kernels, etc, a precise 
formulation of the approximations is needed. Of course, 



one must expect approximations to be needed if one is to 
obtain tractable factorization results. But for the formal- 
ism to be generally applicable, it is necessary to define 
the approximations in such a way that final-state mo- 
menta are unaltered. Attempts to remedy the situation 
are now stymied by the use of conventional "integrated 
parton densities"; there is a mismatch [j| 0, H, [y| be- 
tween the definitions of integrated parton densities and 
the imperatives of factorization with correct final-state 
kinematics. The mismatch extends equally to fragmen- 
tation functions. 

As explained in 0, 0, H, @ , when we use methods that 
treat final-state kinematics exactly, we are led to the re- 
placement of conventional parton densities by more gen- 
eral quantities. The range of methods, from standard 
ones with approximated final-state kinematics to the im- 
proved ones that are the subject of this paper, can be 
characterized by the kind of parton densities, or general- 
ization, that are used: 

• Conventional integrated parton densities: 

These are the usual PDFs; they depend only on 
a longitudinal momentum fraction variable, x, and 
the hard scale, Q 2 . All other components of parton 
momentum are integrated over. Correspondingly, 
the intrinsic external parton transverse momentum 
kt and virtuality are neglected in the hard scat- 
tering. These parton densities have consistent op- 
erator definitions [2] used in the classic proofs of 
factorization. 

• Unintegrated parton densities: These are the 
fc t -dependcnt PDFs obtained when the integral over 
parton transverse momentum is left undone, while 
the minus component (or virtuality) is still in- 
tegrated over. The concept of an unintegrated 
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parton density appeared quite early Q, neces- 
sary for the treatment of the transverse momen- 
tum distribution in the Drell-Yan process. These 
quantities are also called transverse-momentum- 
depcndcnt (TMD) parton densities. There are also 
TMD fragmentation functions. 

Unintegrated parton distribution functions also ap- 
pear naturally in the high energy limit of QCD [8] 
where one of the key features is the lack of the 
transverse momentum ordering of subsequent gluon 
emissions. Small- a; resummation then provides an 
evolution equation in a rapidity variable for this un- 
integrated gluon distribution function. In this ap- 
proach the integrated gluon density is defined sim- 
ply as an integral over the transverse momentum up 
to a hard scale. However, the definitions of TMD 
densities that we refer to in this work may not, in 
general, agree with other definitions appearing in 
the literature. 

As for a definition, the obvious and natural one 
is given by the hadron expectation value of the 
parton number operator in light-front quantization, 
equivalent to a standard simple expression as an ex- 
pectation value of a bilocal field operator, in light- 
front gauge. However, this definition suffers 0, 
from what we will call rapidity divergences, from 
where the rapidity of internal gluons goes to infin- 
ity. These divergences occur even when ultraviolet 
and infra-red divergences are cut off, as reviewed in 

So a correct definition requires an explicit cutoff 
for the rapidity divergences. Collins and Soper 
0, [Tl| used a non-light-like axial gauge for this 
purpose. Certain improvements are probably nec- 
essary, as we will see. Other work suffers from 
an imprecision in the definitions. For example, in 
[H, EH ! w e do not see the operator definitions at 
all, and the issues concerning rapidity divergences 
are completely hidden, probably in some applica- 
tion of a leading logarithm approximation, with 
cutoffs on both real and virtual lines in Feynman 
graphs. However, it is important to understand 
the differences in the definitions of TMDs within 
different theoretical treatments. For example, in a 
recent analysis of dijet correlations at the Relativis- 
tic Heavy Ion Collider (RHIC) [3, it is shown that 
there is a large variation between the predictions 
obtained using different TMDs from the literature. 
Therefore it is necessary to construct a precise and 
unique operator definition of the unintegrated par- 
ton distribution functions. Then, hopefully, the 
different cases discussed above (for example those 
which emerge in the context of BFKL 8] or angu- 
lar ordered [13 , fl5j evolutions) will follow naturally 
as particular limits of the general definition, or as 
changes of factorization scheme. 

For the concept of a parton density to be applicable 



in the real world of hadron physics, it needs to in- 
clude all the relevant non-perturbative phenomena 
concerning the actual state of the parent hadron. In 
particular, the need for TMD densities arises when 
we treat final states in enough detail to be sensi- 
tive to the transverse momenta of partons relative 
to their parent hadrons, etc. More recent work de- 
voted to providing precise definitions to the TMD 
densities was done in Ref. [lj|. 

• Parton Correlation Functions: An even closer 
examination of final-state kinematics, as was done 
for small- a; physics by Watt, Martin, and Ryskin 
H, 0], indicates that "doubly unintegrated" PDFs 
(differential in all components of parton momen- 
tum) are more appropriate in many situations. 
These can also be called "fully unintegrated par- 
ton densities" , but in this paper we will simply 
call them "parton correlation functions" (PCFs). 
Collins and Jung [f| showed very generally that for 
differential distributions in final-states the use of 
parton correlation functions is necessary. Collins 
and Zu Q set up a complete formalism in a model 
field theory, suitable for Monte-Carlo event gener- 
ator implementation, with the possibility of incor- 
porating arbitrarily non-leading-order corrections. 

Under the label of parton correlation function we 
will include both the fully unintegrated general- 
ization of parton densities, and related objects for 
fragmentation, and for soft factors in factorization 
theorems. 

The need in the general case for making no approxima- 
tions on final-state kinematics impels us to a formalism 
that uses PCFs. Unfortunately, a complete treatment 
and derivation of factorization using PCFs does not yet 
exist for QCD. Therefore, our aim in this paper is to 
initiate the construction of such a formalism. The new 
formalism should handle non-leading-order corrections as 
generally as the standard formalism. Thus it will be as 
good, if not better, in cases where more traditional ap- 
proaches are applicable. But it will also apply to more 
general situations. 

To treat kinematics correctly, we need to go back to 
the foundations, and we will see that the basic struc- 
ture of the derivation in fact needs to be significantly 
modified even at the lowest- order, parton-model level for 
DIS. Normally, one starts with the handbag diagram of 
Fig. QJa), where the outgoing struck quark is exactly 
massless and on-shell. But even without any sophisti- 
cated treatment of QCD effects, we know that the quark 
must turn into a jet. So a minimum logical foundation 
starts from Fig.[TJb), where the outgoing quark line now 
has acquired a jet subgraph. In fact, examining the kine- 
matics of such graphs gives the motivation 0, 0, [E Hi f° r 
searching for methods that use PCFs. 

However, in real QCD, this is not sufficient. There 
must certainly be non-perturbative interactions at late 
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FIG. 1: (a) Parton model handbag diagram for DIS. (b) Handbag diagram with outgoing jet factor, (c) The general reduced 
diagram for full QCD with one jet. Note that in graphs like (c), the target collinear gluons may pass through the cut inside 
the target bubble. 



times to neutralize the color of the outgoing quark. Fur- 
thermore, even in perturbation theory, and even without 
going to the conventional domain for NLO corrections 
where extra high transverse momentum jets are produced, 
it is necessary to consider all regions of the form sym- 
bolized by the graphs of Fig. QJc). There are arbitrar- 
ily many collinear gluons exchanged between the jet and 
spectator subgraphs and the hard scattering, and there 
are arbitrarily many soft (interacting) gluons connect- 
ing the jet and spectator subgraphs. We do not have 
topological factorization [13] of the subgraphs. Only af- 
ter an application of Ward identities to give a coherent 
sum over different graphs does one obtain factorization 
for the cross section. A consequence is that, to obtain a 
correct factorization formula, one needs to make a careful 
selection of an appropriate gauge-invariant definition of 
the parton correlation functions. So the key issues center 
around what such a definition is and why. 

These issues are in fact present in conventional fac- 
torization, although most treatments do not emphasize 
them. The reasons why we now need to examine them in 
much closer and exact detail are coupled to our current 
aims of treating final states less inclusively. The Ward 
identities used to derive factorization apply only after 
the application of certain approximations. In the most 
obvious way of setting them up, the approximations are 



only valid in regions of appropriately strongly ordered 
kinematics. Moreover, there is a general tendency in 
constructing derivations to neglect power-suppressed cor- 
rections whenever it assists the ease of derivation. The 
Ward identity arguments are therefore typically applied 
to amplitudes where cutoffs are imposed on the momenta 
of internal lines of the graphs. But the Ward identities 
are only exact when applied to amplitudes in the exact 
theory, i.e., when no cutoffs whatever are applied to the 
internal momenta other than, possibly, a conventional 
gauge invariant cutoff for UV divergences. 

There is therefore a conflict between the cutoffs used in 
discussing why factorization occurs and the lack of cutoffs 
needed for the validity of the Ward identities. Once we go 
beyond the leading logarithm approximation we cannot 
restrict the kinematics to strongly ordered regions. A 
careful and precise formulation of approximations and 
Ward identities is needed, otherwise there are likely to 
be present uncontrolled correction terms that can readily 
violate factorization. A subtraction formalism, such as 
we will use, avoids the problems with direct cutoffs on 
the momenta of lines of Feynman graphs. Examination 
of the literature — e.g., [l7], (3 — shows that the Ward 
identities for proving factorization are typically stated 
and proved too inexplicitly for our purposes. 

We stress that the treatment in this paper is limited to 
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lowest order in the hard scattering, where the final states 
are rather simple. What we discuss in this paper may 
be thought of as a formal generalization of the parton 
model to cases where the details of over-all kinematics 
are important. This is a critical step toward a complete 
formalism because higher order calculations rely on the 
subtraction formalism to avoid double counting the ze- 
roth order contribution. Knowing exactly what is sub- 
tracted requires a complete understanding of the zeroth 
order contribution. 

If it turned out that factorization were universally true, 
these issues would not be pressing. However, recent work 
indicates otherwise. For example, it has been shown 
[H [H [H HI HI, rather unambiguously [H that fac- 
torization fails for production of hadrons of high trans- 
verse momentum in hadron-hadron collisions. This is 
in the case that the hadrons are close to back-to-back 
azimuthally, when unintegrated parton densities are an 
appropriate tool. See also the recent paper by Bauer and 
Tackmann [24| for a closely related argument. 

In other situations, model calculations can appear to 
show that factorization fails, but the culprit is an in- 
appropriate definition of parton densities. An example 
is the transverse single-spin-asymmetry (SSA) in semi- 
inclusive deep- inelastic scattering (SIDIS), which Brod- 
sky, Hwang and Schmidt (25[ found to be non-factorizing. 
Collins [26| showed that factorization does actually hold, 
but only if suitable Wilson line operators are used in the 
definition of the unintegrated parton densities. 

These motivations lead us to the detailed treatment 
in the present paper. The issue that makes the work of 
Collins and Zu [6| insufficient is the presence of extra 
gluonic connections between the subgraphs of Fig. QJc). 
Further difficulties are associated with the masslessness 
of the gluon in QCD, and with the complicated nature 
of Ward identities in a non-Abelian gauge theory. It is 
useful to solve the difficulties one-by-one. So the full 
technical results in this paper are restricted to the case 
of an Abelian gauge theory with a massive gluon. How- 
ever, much of our treatment applies more generally and 
includes QCD. Furthermore we will obtain a factoriza- 
tion stated in terms of PCFs with precise definitions as 
certain operator matrix elements. The statement of fac- 
torization is equally applicable to QCD. 

In Sec. |TT] we will deconstruct the standard parton 
model/handbag diagram in the context of pQCD. Then 
we will discuss in Sec. IIIII what requirements are needed 
for a formalism that gives a good treatment of parton 
kinematics. In Sec. IIV1 we discuss the projection oper- 
ators which enable us to make approximations suitable 
for obtaining the standard parton model. In Sec. |V] we 
list a set of candidate definitions for the PCFs, and dis- 
cuss the reasoning for our choices. Sec. I VII is devoted to 
a subtraction procedure and in Sec. IVHI we discuss the 
kinematic approximations and the rapidity differences. 
In Sees. I Villi and IIX1 we present calculations to sup- 
port the consistency of the structure outlined in Sec. IIIII 
Finally in Sec. [X] we present all-orders proofs of the the 



factorization formula with the PCFs, including the nec- 
essary Ward identities, but at this point restricted to an 
Abelian theory. 



II. THE BREAKDOWN OF STANDARD 
KINEMATIC AL APPROXIMATIONS 

A. The parton-model approximation 

In this section we carefully analyze the derivation of 
the parton-model approximation for DIS, paying special 
attention to the analysis of parton kinematics. 

Although Fig. [Ha), with its single on-shell final-state 
struck quark, is the usual starting point, we actually need 
to start with Fig. [ljb) , where the quark fragments into 
a group of final-state particles. We will show in what 
sense the sum over all graphs of the form of Fig. [ljb) is 
approximated by Fig. [2a), i.e., we will analyze how it 
is possible to neglect all higher-order corrections to the 
final-state bubble J(k + q). For this part of the discus- 
sion, we will restrict our attention to graphs without the 
extra gluonic attachments shown in Fig. QJc) . 

In Fig. [ljb), q is the incoming virtual photon mo- 
mentum and P is the momentum of the target (pro- 
ton). As usual, Q 2 — —q 2 is the photon virtuality and 
XBj = Q 2 /(2-P • q) is the Bjorken scaling variable. The 
incoming momenta may be expressed as 

(i) 

Here, to provide a simple formula we use the Nachtmann 
variable x instead of the Bjorken variable. Since we want 
to start with no approximation on kinematics, we note 
the exact relation between the two variables: 

X = — *m ( 2 ) 

1 + ^1 + 4^4, 

they are equal up to a power-suppressed correction, as 
Q — > oo. 

The contribution to the hadronic tensor from Fig.[TJb) 

is 

w^( q ,p) = e£/ -^nrj(k+qh v mp)}- 

j 

(3) 

Here, the sum is over quark flavors, and ej is the electric 
charge of quark j in units of the size of the electron's 
charge. We will leave implicit the dependence on j of the 
target and jet factors J and <F We will sometimes write 
the outgoing quark momentum as I = k + q. The kine- 
matic constraints, that the final states in J{k + q) and 
&{k, P) have positive energy and positive invariant mass, 
impose limits on the values of k where the integrand is 
non- vanishing. 
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To obtain the standard LO DIS expression, we apply a 
number of approximations valid at the leading power of 
Q. First, we expand both the upper bubble, J{k+q), and 
the lower bubble, &(k, P), in a basis of Dirac matrices: 

*(*, P) = $ s + 7 m $m + o*"*^ + 75$5 + 7 M 75$5 M , 
.7(0 = 7s + 7 M 7 M + + 75 Js + 7^75^5 • 

Then we observe that in the Breit frame, the trace in 
Eq. ([3]) is dominated by terms containing 7~ in $ and 
7 + in J , i.e., 7 _ $ + and 7 + j7"~, together with terms only 
relevant for polarized scattering. These terms dominate 
because the coefficients <£> + and J~ are the large ones 
after a boost from the rest frames of the final state of 
each of the bubbles. Therefore, up to power-suppressed 
corrections, 

Tv[ 1 ^J{k + q) 1 v ^{k,P)] 

~Tfc[yyhy* 7 -] j-(fc + g )$+(fc,P). (5) 

Next, we focus attention on values of the quark mo- 
mentum where parton model kinematics are good. First, 
for each parton line, we define corresponding massless 
momenta: 



fc= (x Bj P + ,0,O t ) 



0, 



2x B \P^ 



(6) 



LO calculations. The parton- model region is where the 
transverse and minus components of k are small rela- 
tive to Q, and where k and I both have small virtualities 
relative to Q 2 . Then we can treat k and I as good ap- 
proximations to k and I: 



k. 



I ~ I, 



(7) 



These are the parton momenta that normally appear in 



with the errors being small compared with the large com- 
ponents of k and I. (For example the transverse momen- 
tum might be of order M, compared with order Q for the 
large components.) 

The standard parton-model approximation is obtained 
by neglecting the small momentum components, k~ , kt, 
l + , It, i.e., by replacing k and I by k and /. However, 
the replacement is only applicable in the hard scattering, 
where we can neglect the small momentum components 
with respect to Q. It is incorrect to replace k by k in 
<!>(£;, P) and I by I in J (I) because internal virtualities 
in $ and J may be small. But it is valid to replace k + 
by the fixed value xb)P + inside the lower bubble, and to 
replace fc~ + q~ by q~ inside the upper bubble. These 
give only small fractional shifts in the large components 
k + and fc~ + q~ , and they give small fractional shifts 
in the lines' virtualities. Furthermore, we can perform 
a small Lorentz transformation to set to zero the trans- 
verse momentum entering J^(k + q). After this we change 
variables for the k integral to k~, k t and l + to obtain a 
factorized approximation to W^ v (q,P): 



J 



T PM W^(q,P) 



dk-d 2 k t 



<S>+{x Bi P+,k-,k t ;P) 



dl+Tr^j+^j-) J-(l+,q- ,0 t ) 



(8) 



The symbol Tpm, which we call the "parton model ap- 
proximator" , represents the operation of replacing the 
integrand in Eq. (J3J) by the integrand in Eq. ©. 

At this stage, we should emphasize a distinction im- 
portant for a more detailed treatment of final states. 
While the approximations, Eqs. (JTJ) , are clearly good in 
the hard scattering calculation if k~ and kt are small, 
the shift in integration variables needed to get Eq. ([5]) 
introduces errors in the evaluation of $(fc, P) and J{1) 
that need to be examined more carefully. Within the 
parton-model region of collinear quark momentum, the 
integrand in ||HJ) is a good numerical approximation to 
the original integrand, if it is a smooth enough func- 
tion. However, because it involves replacing final-state 
momenta by somewhat different momenta, the approxi- 
mation will change certain kinds of cross sections differ- 
ential in the final state. 

Even when we only treat inclusive cross sections, the 
changes in the kinematics affect the positions of thresh- 
olds. Indeed, the approximated nonperturbative factors 



$ + (xBjP + , k~, k t ; P) and J~{1 + , q~ , t ) no longer re- 
strict k to the actual kinematically allowed values of the 
original unapproximated integral. Therefore, the approx- 
imations leading to Eq. ([5]) can lead to unphysical results 
[27l | , particularly if one is interested in the details of the 
final state. 

For purely inclusive DIS, the usual formalism in- 
cludes higher order corrections that provide extra large- 
transvcrse-momentum jets, with Eq. ([5]) corresponding 
to the first term in a perturbative expansion of the hard 
scattering. Higher-order terms in the hard scattering in- 
clude terms that can compensate for kinematic approxi- 
mations that are particularly bad at large k t . 



B. Parton density and Wilson lines 

We now use the approximation Eq. ([5]) to explain a 
definition of a parton density. First, we recognize that 
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3> + (xBjP + , k , kt, P) may be written as 

<S>+(x Bj P+,k~,k t ,P) = jTr[~f+<i>(x Bj P + ,k-,k t ,P)] , 

(9) 

which suggests the following definition for the quark PDF 



fj( x Bj) 



dk-d 2 k t 



-Tr 



(10) 

The overall numerical factor of 1/2 is the standard con- 
vention; it ensures that the PDF has exactly the nor- 
malization of a number density, at least in field theories 
where light-front quantization is non-problematic. 

Now, the integral on the right-hand side is UV- 
divergent in a renormalizable theory like QCD so, as it 
stands, Eq. (flTj|) is ill-defined and needs to be replaced 
by something else. The divergence comes from regions 
of the integral where k" and k% are large, i.e., from val- 
ues of k that are far from parton kinematics. This is the 
domain where higher-order corrections to the hard scat- 
tering are important, so it is appropriate to modify the 
definition while preserving its treatment of the parton- 
model region. 

One possibility is to place some sort of UV cutoff on 
the integral in Eq. (|10[) near the hard scale, k~,k t < 
Q [28j]. While physically plausible, such a definition has 
problems when one tries to make it gauge invariant — the 
same problems that we will have to solve in our improved 
treatment with parton correlation functions. 

The solution that is in fact used for normal QCD fac- 
torization, and that corresponds exactly to what is done 
with the operator product expansion for DIS, is to apply 
UV renormalization to the bilocal operator. With the in- 
sertion of appropriate Wilson line operators, which give 
gauge-invariance, we get the usual definition [2| 



/i(XBj,/i) = 



dw 
An 



x (p|^(0,i i ;-,O t )KlK J )7 + ^Kj)V'(0)b)fl. (11) 

Here, ^(w) is the field operator for quark j, and \p) 
is the proton state vector. The subscript, R, indicates 
that the operator is renormalized using ordinary UV- 
renormalization techniques. This definition reproduces 
the basic structure of the integral in Eq. (fT0"|) , but renor- 
malization removes the UV divergence with a renormal- 
ization scale, /i. However, a derivation of factorization 
must allow for graphs with extra gluon exchanges, as in 
Fig. [He). It is known that therefore in the operator defin- 
ing the parton density, there must be inserted a path- 
ordered exponential of the gluon field along the light-like 
direction joining the quark and antiquark fields, as in 
Eq. (fTTj) . This also makes the definition gauge-invariant. 
Deriving an appropriate generalization for a parton cor- 
relation function, where the separation of the two quark 
fields is no longer light-like, will be an important part of 
the present paper. 



We will find that we need a Wilson line that goes out 
to infinity from the origin along one line, not necessarily 
light-like, and returns along a nearby line to a point w. 
So as a general notation we define V w (n) to be a Wilson 
line from w to infinity in the direction, n: 

V w (n) = P expf-ig J dXn-A(w + Xn)j . (12) 

Here, the symbol P is a path-ordering operator. 

In Eq. (fTTj) . we use a light-like direction, i.e., we replace 
n by the vector uj = (0,1, Ot); the separation of the fields, 
W, is in the same direction. Thus in the combination 
Vw (wj)Vb(uj) the segments between w and 00 cancel, so 
that 



V^(«j)Vo(uj) =P 



o.xp| —ig J dXuj ■ A(Xuj) 



(13) 

Thus the Wilson line is simply along the straight line 
joining the quark and antiquark fields. 

If we set /1 ~ Q, the hard scattering can, as is well 
known, be usefully calculated as a power series in ot s {Q), 
which is small because of the asymptotic freedom of 
QCD. (However, the simple use of the perturbation ex- 
pansion breaks down at small and large x.) 



C. Quark fragmentation factor 

The last factor in Eq. ([5]) is an integral over a cut 
propagator. In the absence of UV problems a simple 
unitary argument shows that the integral over all values 
of l + is equal to the value obtained by integrating just the 
lowest-order term in the cut propagator. Equivalently, we 
may replace the final-state jet bubble as follows, 

/ 'di+iv(y7V7lr(i) 

1 > y dZ+Tr(Viyv) 2ttS+(P) 

Here the approximated momentum in J is I = 
(l + , q~ , Ot), and we have normalized the trace so that it 
corresponds to the trace in the calculation of a partonic 
cross section. Graphically, (|14p corresponds to replacing 
Fig. [IJb), where the full final-state jet is included, by 
Fig. [TJa), with the lowest order quark propagator. It is 
important to recognize that two assumptions are neces- 
sary for this identification to be justified — that it is valid 
to use the approximate momentum variable, I, in the up- 
per bubble, and that it is valid to allow the integrals in 
Eq. ijHJl to be unconstrained by kinematical requirements. 
These assumptions go beyond the use of on-shell parton 
kinematics in the hard scattering. 
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FIG. 2: The amplitude for 7*p scattering into two jets with 
fixed masses. 

There is a delta-function that forces I 2 = and hence 
l + = 0. We use I = (0,q~,0 t ) to denote the resulting 
light-like momentum, and then we have 

T PM W^(P,q) = J^^jzM^^Tr^h^)- 

(15) 

Projecting out the F 2 (xBj,<3 2 ) component produces the 
familiar expression 

F 2 (x B j,Q 2 ) ^ ^2<-:-rH\fjr H :\ . (16) 

3 

The diagrammatic representation of Eq. (JT5J) is the fa- 
miliar handbag diagram in Fig.QJa). In fact, Fig. Ufa) is 
the typical starting point for most pedagogical introduc- 
tions to a pQCD treatment of DIS (e.g., [29]), although 
we see now that the justification for using Fig. QJa) in- 
volves a number of non-trivial steps. 

For our further work it is important to emphasize the 
distinction between the approximation that one restricts 
attention to the generalized handbag formula, Eq. ([3]), 
and the set of approximations that lead from it to the 
standard parton- model formula Eq. (| 1 5[) . In writing 
down Eq. ([3]), the only approximation is to restrict to cer- 
tain topologies of graph, whereas to reproduce Eq. (fT5"|) . 
we made several very non-trivial kinematic approxima- 
tions [49l | . It is these later approximations that we will 
find we need to avoid. 



D. The Limits of Standard Kinematical 
Approximations 

The approximations at issue change the momenta of 
final state particles. This can give problems whenever 
cross sections are investigated that are differential in 
final-state jets, for example by producing final states that 
violate energy- momentum conservation [27|. In particu- 
lar, at large x, the true parton kinematics are strongly re- 
stricted, whereas Eq. (jHJ) has these restrictions removed. 

To see this more explicitly, consider a particular final 
state consisting of two outgoing jets, Fig. [21 of momenta I 
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FIG. 3: An event in which the collinear, on-shell matrix el- 
ement — the photon-quark vertex in this example — is ac- 
companied by initial and final state showers. (In the showers, 
solid lines denote both quarks and gluons.) 

and kx- Let the invariant masses of the struck quark jet 
and the spectator jet be Mj and Mx- The Mandelstam 
variable, s, is 

O 2 

s = (l-x)Af 2 + -^(l-x). (17) 

F x 

In the center-of-mass frame, the 3-momenta of p— k and I 
are equal and opposite, so that the internal parton trans- 
verse momentum obeys k 2 — If — k\ x . Thus 

s = (l° + k x f 

= (y/Mj + k 2 + l\ + JM X + k 2 + k 2 zX ^j . (18) 

Since M x ,Mj, I 2 , k 2 x > 0, we have 4fc ( 2 < s. Thus, from 
Eq. (|17[) we get a strict upper limit on the kinematically 
allowed values of kf , 

kl<^Ml + ^H-x). (19) 

When x is close to one, this limit is much less than Q 2 . 
This is in gross contradiction to Eq. ([S]) where the integral 
over k is unrestricted. Even if we apply renormalization 
at a scale [i ~ Q, this implies an effective cutoff of order 
Q, far above the actual kinematic limit. However, it is 
not sufficient to set fi ~ — x, since corrections to 

the photon vertex still have an external virtuality Q 2 , 
for which the scale \x ~ Q is appropriate. There is a mis- 
match of scales. Therefore, we have an example where 
Eq. ([H]) is inappropriate even for a totally inclusive pro- 
cess. 

There are further problems with using the framework 
of standard collinear factorization for observables that 
are differential in final state kinematics. This is illus- 
trated in Fig. [3] where the photon-quark vertex is now 
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accompanied by initial and final state showers. This sit- 
uation is appropriate not only for the discussion of jet 
cross sections, but also for the theory of Monte-Carlo 
event generators (MCEGs). Then the mass of the outgo- 
ing jet is given by 

m 2 j = (k + q) 2 = 2(k+-xP+) + JL-Vfcf, (20) 

so that 

* + = rf+ + IFTF)' < 21 > 

which is strictly greater than xP + . (Note that k~ is 
always negative.) This shows that away from collincar 
kinematics, there is a substantial inconsistency between 
the value of the longitudinal momentum used to evalu- 
ate the parton density, namely k + = xP + ~ x-Q- i P + and 
the correct value of k + . Since we must allow the trans- 
verse momentum and the parton virtualities to range up 
to large values, this represents a substantial shift in k + . 
The value of k + depends on both target-related and jet- 
related variables, so particularly difficult problems arise 
in constructing a systematic treatment of higher order 
corrections in a factorization framework with conven- 
tional PDFs, as explained by Collins and Zu @. Dif- 
ferent numerical values for the same quantity are used at 
different places in the formalism. 

The important conclusion of this section is that the 
steps that allow one to replace Fig. [T{b) with Fig. [TJa) 
introduce possibly large errors in certain types of calcu- 
lation. Since the kinematical approximations that allow 
us to replace Eq. ^ by Eq. ([5]) are what normally al- 
low us to replace the final state jet bubble in Fig. [IJb) 
with the on-shell massless parton in Fig. [lja) , then a uni- 
fied treatment must improve the approximations to avoid 
changing momenta in the final state. 

III. WHAT IS NEEDED 

In the last section, we argued that some of the approx- 
imations that lead to the parton-model formula need to 
be avoided, because they produce large kinematical er- 
rors that affect the measured final state. Furthermore, 
from the analysis of Libby and Sterman [3lj . we know 
that for QCD the correct starting point is the sum of 
regions represented by Fig. [He), and not just graphs (a) 
or even (b). In order to get a factorization formula and 
be able to perform perturbative calculations, we need to 
rewrite Fig. QJc) in a useful approximation as the product 
of a hard part that can be calculated directly with ordi- 
nary Feynman graphs for on-shell external partons, and 
a collection of universal parton correlation functions to 
describe the non-perturbative physics. Therefore, what 
is needed is a set of approximations and Ward identities 
that reduce Fig.QJc) to a factorized form, but without the 
problems with kinematics that we have just discussed. 



Each region of the form shown in Fig. [T](c) has a set of 
lines collinear to the target, subgraph <I>, a set collincar 
to the outgoing quark, subgraph J, a set of soft lines, 
B, and two hard subgraphs H.l and H.r (on the left and 
right of the final-state cut). (We use the symbol B for 
the soft bubble in a general graph. The symbol S is 
reserved for the soft PCF whose definition will arise when 
we discuss factorization.) For each region, we will define 
an approximator, in the same spirit as Eq. ([8]). Our aim is 
to find definitions of approximators that simplify as much 
as possible the systematic application of Ward identities 
without uncontrolled remainder terms. 



A. Requirements on approximators 

On the basis of the observations in the previous sec- 
tions, we propose that the approximators should obey 
the following: 

1. The kinematics of the initial and final states must 
be kept exact. Otherwise, large errors occur in cer- 
tain types of calculation. 

2. The bubbles representing the sums over physical fi- 
nal states must be kept explicit. 

For example, we must take Fig. []Jb) rather than 
Fig. QJa) as the starting point of the derivation of 
the parton model. It can be argued that the in- 
tegral over final-state bubbles such as J is unity, 
as in Eq. (fT4")l . But this involves a cancellation be- 
tween final states of different invariant masses, and 
this violates the first requirement. 

3. To avoid making kinematical approximations in the 
initial and final states, the non-perturbative factors 
need to be functions of all components of parton 
four- momentum. 

Hence all of the non-perturbative factors are fully 
unintegrated factors, rather than standard PDFs, 
i.e., they are PCFs. In addition to the fully uninte- 
grated PDF, we need to define a fully unintegrated 
soft factor and a fully unintegrated jet factor. 

4. The hard scattering matrix element should appear 
as an on-shell parton matrix element in the final 
factorization formula. 

• Setting on-shell the external partons of a hard- 
scattering subgraph involves no shift of the 
momenta of observable lines. Thus it is a safe 
choice, since it merely involves changing the 
numerical value of the integrand. 

• The use of on-shell and massless matrix ele- 
ments allows the use of already existing Feyn- 
man graph calculations. The only changes 
from the usual case concern the subtrac- 
tion terms to remove the double counting of 
collinear and soft contributions. 



9 



• This is the primary place where explicit 
higher-order calculations of Feynman graphs 
are actually used. These calculations are 
much easier when on-shell and massless. 

• It is much easier to maintain gauge-invariance 
in on-shell amplitudes than in off-shell ampli- 
tudes. For the PCFs, we make gauge invariant 
quantities with the aid of Wilson line factors 
in their definitions as matrix elements of op- 
erators. But this is much harder to do in the 
hard-scattering coefficients unless their exter- 
nal parton lines are on-shell. 

5. We must be able to apply Ward identities exactly 
to the approximated graphs, in order to convert the 
gluon exchanges in Fig. [TJc) to a factorized form. 

• Any approximation on momenta inside the 
hard scattering matrix element should be con- 
sistent with the use of Ward identities. In the 
process of factorizing soft and collinear gluons, 
it will be important to identify contributions 
to the PCFs. The resulting constraints on the 
organization of the approximations will lead to 
corresponding constraints on the definitions of 
the PCFs. 

• It is easy to get a situation where Ward 
identities are applicable only with remainder 
terms that are of nonleading power in what we 
will term the core region of an approximator. 
These are typically of leading power when the 
integrations are extended, as is always neces- 
sary, to a full range of kinematics. As far as 
possible, therefore, the approximators should 
be arranged so that the remainder terms are 
exactly zero. Otherwise, explicit treatment of 
the remainder terms is needed to get factor- 
ization beyond a leading logarithm approxi- 
mation. 

6. Each approximator should give a good approxima- 
tion in a particular region of momentum space, but, 
for the purpose of proving factorization, should be 
well-defined for all momenta for which it is used. 

• In the context of a systematic subtraction 
scheme — Sec. |VI] — applied to all orders, 
an approximator Tr for a region R is used to 
provide a good approximation to a graph in 
region R, with errors suppressed by a power 
of A/Q in the core of the region. (A is a char- 
acteristic hadronic mass scale.) 

• For a larger region R%, we will apply its ap- 
proximator to the graph with contribu- 
tions from smaller regions subtracted, to com- 
pensate for double counting. 

• In order for this procedure to work, we need 
to take as the contribution of the region R the 



integral of its approximation up to where the 
error becomes of order 100%. 

• Thus it must be possible to extend the for- 
mula for TrT beyond the core of the region 
R. Therefore, its definition cannot assume the 
momenta are in the core of the region. 

In the next section, we will address the first four points 
by returning to Fig.[]Jb) and demonstrating how the hard 
scattering part of the graph may be approximated with- 
out violating initial and final state kinematics. To ad- 
dress the last three items, we will discuss the definitions 
of the PCFs. There we give a set of candidate definitions 
for the PCFs. Having completed this, we will be in a po- 
sition to approximate Fig. [He) to produce a factorization 
formula in terms of the lowest order on-shell-parton hard 
scattering amplitude and the PCFs of Sec. [Vj 



B. Collinear Gluons in the Standard, Integrated 
Treatment 

To clarify the strategy for dealing with soft and 
collinear gluons, it is useful to recall relevant steps for 
obtaining the appropriate gauge-invariant definition of 
the fully integrated PDF, Eq. (fTTj) . with LO factoriza- 
tion in the standard formalism. Relevant graphs beyond 
the standard parton model graph Fig. [IJa) are those of 
the form Fig.|4ja), where we extend the handbag diagram 
to allow for an arbitrary number of gluon exchanges con- 
necting the lower bubble with the outgoing quark. We 
restrict the extra gluons to be collinear to the target. To 
leading power, the gluons are longitudinally polarized. 
We thus have a special case of Fig. Hfc). By the use of 
a simple Ward identity, it is seen that the gluon attach- 
ments eikonalize, and may be converted into a Wilson line 
factor, as illustrated in Fig.fJJb). This leaves the convolu- 
tion product of the on-shell LO parton scattering ampli- 
tude with the gauge- invariant PDF given in Eq. (TTTj) . In 
the light-cone gauge, A + = 0, the Wilson line operator is 
unity and we exactly reproduce the most naive graph for 
lowest order hard scattering, Fig. []Ja), and graphs with 
extra collinear gluon exchanges are power suppressed. 

As we will see, a number of complications arise in ex- 
tending these ideas to deal with more general cases. Our 
aim is to make a precise, general-purpose formalization 
suitable for the generalization. 



C. Soft and Collinear Gluons in the Generalized 
Formalism 

Since we do not make any kinematical approximations 
on initial- or final-state momentum variables, our gen- 
eralization of the handbag diagram will contain all final 
state bubbles. In addition to the initial-state-collinear 
gluons, we also need to show that final-state-collinear 
and soft gluons also factorize into appropriately defined 
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FIG. 4: Target collinear gluons explicit in the definition of 
the standard integrated PDF. 



gauge-invariant PCFs. We will find that after appropri- 
ate approximations, we can apply Ward identities that 
disentangle the coupled subgraphs in Fig. QJc) to give 
the factorized form shown diagrammatically in Fig. I24f c) 
below, up to power suppressed corrections. After some 
further manipulations to compensate for double count- 
ing, we find that the cross section, a (or a related object 
like a structure function), is a convolution product of a 
hard factor, a PDF, a final state jet factor, and a soft 
factor: 



C®F(g)J®S + 



(22) 



Equation (|22jl establishes our notation for the PCFs - 
F for the target PCF, J for the jet PCF, and S for the 
soft PCF. The notation should be carefully distinguished 
from the notation J, and B) that we have used so far 
in discussing the subgraph bubbles in a particular graph. 
The last term in Eq. (j22"|) indicates that errors should 
be suppressed by a power of A/Q where A is a typical 
hadronic mass scale and the power is a > 0. 



IV. THE BASIC APPROXIMATION 

In this section we reexamine and reformulate the par- 
ton model approximation, as appropriate for Fig. [ljb), 
in a form suitable for our later work. We arrange to 
use PCFs rather than regular parton densities. As re- 
gards parton kinematics, a suitable definition was given 
by Collins and Zu Q. We now extend this to convenient 
projections onto appropriate two-dimensional spaces for 
on-shell massless Dirac spinors. Although the calcula- 
tions are quite elementary for tree graphs, a precise for- 
mal definition with a convenient graphical notation will 
greatly assist later work with higher order graphs. 

Since we normally work with scalar structure func- 
tions, F\ etc, we define a projection tensor P^ for any 
chosen structure function. Thus, the projection onto F\ 
is done by 



Pn 



{p ■ q y + miq? 



(23) 



so that F x = P^W^(q,P). 

We now apply such a projection to Eq. ([3]): 



P 



47T 



e?Tr 



4 3 



J(l)j»$(k,P)j v 



(24) 



From here on, a sum over quark flavors, j, is implicit. 

The first step is to replace exact parton momentum 
variables with approximated parton momentum variables 
(indicated with a hat) inside the hard matrix element: 



k t— > k, I f — ► I. 



(25) 



These are the same as in our treatment of the conven- 
tional parton model; but now we no longer use the other 
kinematic approximations in the J and $ factors. The 
approximated momenta, defined in Eq. ([6|) , form a partic- 
ular case of the prescription in Ref . [6j . They are uniquely 
determined by the following requirements. First, the ap- 
proximated momenta, k and I, describe a collinear struck 
parton and an on-shcll final state parton, 



Iktl =k~ =0, 



P = 0. 



(26) 



Then we require four-momentum conservation for both 
the exact and approximated variables: 



k + q = I, k + q = I. 



(27) 



Next we formalize the projection onto the leading 
power terms in the trace over Dirac matrices by defin- 
ing projection matrices Vt and Vy. 



Pt = \p" 



7 



•p 1 + - 



where 



7 



-(7° 



7 3 ), 7 + = - 7f (7° + 7 3 ) 



(28) 



(29) 



As elsewhere, we use the subscript T to denote target- 
related quantities, and J to denote jet-related quantities. 
Some properties of the projection matrices are listed in 
App. El It can be readily checked that Vt projects a 
general 4-dimensional spinor onto the 2-dimensional sub- 
space of spinors u that obey the massless Dirac equation 
for momentum k, i.e., $u — 0. Similarly, Vj projects 
onto spinors for momentum I. 

We next use the decompositions (0} of J{1) and 
$(/c,P) in a basis of Dirac matrices. The terms that 
appear at the leading power in the trace can be pro- 
jected out by sandwiching $ and J between projection 
matrices: 

V T ®(k, P)V~^^V T $(fc, P) Vi 

= 7~$+ - <j- j <5> +j + 7~7 5 $! , (30) 
VjJ{1)Vj=VjJ{1)Vt 

= 1 + J~ - o +J J- J + 7 + 75 Jf ■ (31) 
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Using them, we replace Eq. (j24|) with its parton-model 
approximation 



P, 



47T 



x e^Tr 



dk+dk-d 2 k t 
(2^) 4 



(32) 



which is changed from its previous definition. The er- 
rors incurred by making this substitution are power sup- 
pressed. We now restrict to the case of unpolarized scat- 
tering, for simplicity, in which case only $ + and J~ ap- 
pear, so that Eq. (|3"2"|) can be rewritten as 



(33) 



t rrwi P ^ f dk+dk~d 2 k t 

ipMl [W\ = — 

4-7T 



x e^Tr 



(2tt) 4 

^ + J-{l)^^-^ + {k,P) 7 " 



Dividing and multiplying by k + = XBjP + and I = q 
we obtain 



J t + q 



FIG. 5: A schematic representation of the approximation per- 
formed in Eq. ((36)) . 



the final state. This is to be contrasted with the situation 
in Eq. ©. 

The symbol, Tpm, is the "approximator" which acts 
on Eq. (|24|) to produce the approximation in Eq. (|36)l . 
Graphically, we depict the operation of Tpm by a circle 
around the electromagnetic vertex, as in Fig. [5] It is 
defined as follows: 

• Everything outside the circle is left unapproxi- 
mated. This is the essential change from the stan- 
dard parton model. 



PM 



P 

r = M ' y 

47T 



d A k 



(2ir) i k+l- 



x e^Tr 



t-ft-f 



<f> + {k,P)J-(l). (34) 



This gives a good approximation to T so long as the in- 
tegral is dominated by the region where k~ , |k t | <C k + . 
The lowest order hard matrix element squared is imme- 
diately identifiable, and we define it as 



\H (q,k,l)\ 



1 



(35) 



The factor 1/2 ensures that this is normalized just like 
the Born graph for scattering on a spin-averaged mass- 
less quark. This definition then entails a factor of l/l~k + 
outside the hard scattering amplitude. Thus, the hard 
scattering is evaluated with the on-shcll parton ampli- 
tude, while the momentum used to evaluate the PDF and 
the jet factor remain exact. We write the approximation 
as 



TpMr 



i 

2^ 



d 4 fc 



(27T) 4 fc+/- 



x \H (q, k, 0| $ + (k,P)J-(l). (36) 



In the hard matrix element in Eq. (|36[) . the hatted ap- 
proximate variables should be regarded as functions of 
the exact variables. All the nonperturbative objects (the 
PCFs) are evaluated with unapproximated momentum 
variables. Thus the only kinematical approximation is in 
the evaluation of the hard matrix element. 

Finally, notice that at this order there is no ultraviolet 
divergence corresponding to |fc 2 | — > oo because of the 
kinematic constraints and positive energy condition on 



• The label T next to the circle symbolizes where ap- 
proximations appropriate to the initial-state quark 
are made: 

— Parton momentum k is replaced inside the cir- 
cle by fc, thus projecting it onto the plus di- 
rection. 

— The projection matrix Vt is applied in Dirac 
spinor space. 

— In the complex conjugate amplitude, to the 
right of the final-state cut, the projection ma- 
trix is Vt = Pi- 

• Similarly, the label J symbolizes where approxima- 
tions appropriate to outgoing quark are made: 

— Parton momentum I is replaced inside the cir- 
cle by I, thus projecting it onto the minus di- 
rection. 

— The projection matrix Pj is applied in Dirac 
spinor space. 

— In the complex conjugate amplitude, to the 
right of the final-state cut, the projection ma- 
trix is Pj = Pt- 

The approximations that lead to the factorized form in 
Eq. (|36p are shown diagrammatically in Fig. [6l In con- 
trast to the standard formalism for DIS, the momenta 
used to evaluate <& + (fc, P) and J~{1) in Eq. (|36[) are ex- 
act. Furthermore, the integrals over k are constrained by 
the positive energy condition and relativistic kinematics, 
in contrast to Eq. |3)). 
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FIG. 6: The approximation of Eq. fl36j. 



V. DEFINING PARTON CORRELATION 
FUNCTIONS 

A major complication in developing a generalized 
treatment of DIS is in the difficulty of giving appropriate 
and consistent gauge invariant definitions for the PCFs. 
In the absence of gauge fields, the definition of the PCFs 
is clear Q: they are just the obvious matrix elements of 
quark fields, Fourier transformed into momentum space. 

In a gauge theory, there are two sources of complication 
that are intimately tied to each other: (a) There are im- 
portant leading contributions from regions like Fig. [He), 
where gluons connect subgraphs that correspond to very 
different ranges of rapidity, (b) The operators in the defi- 
nition of the PCFs must be made gauge-invariant. As we 
will see in the later sections, the gluon connections can 
be converted to a factorized form by applying certain 
approximations, after which Ward identities are used to 
show that the sum over the gluon exchanges corresponds 
to contributions associated with Wilson lines inserted in 
the partonic operators in the definitions of the PCFs and 
PDFs. 

The Wilson lines give gauge-invariant definitions, and 
there is, a priori, a choice in the path used in the integral 
over the gluon field in the Wilson lines. However, only 
certain directions are suitable, i.e., consistent with fac- 
torization. Evidently, a complete discussion can only be 
made within the context of a treatment of factorization 
of soft and collinear gluons as will be described below in 
Sec. I Villi But given that a definition has been found or 
proposed, its properties can be discussed separately from 
the motivation. 

The characteristic difficulty is that the most obvious 
definitions have divergences. This applies not just to the 
PCFs but to PDFs as well. There are three basic sources 
of divergence: 

1. UV divergences due to integration to infinite 
transverse momentum: These appear in graphs 
for ordinary integrated PDFs in all renormalizable 
theories. They also appear in virtual graphs con- 
tributing to unintegrated PDFs and PCFs. In 
all cases they can be removed by renormalization 



counterterms, beyond those needed to renormalize 
the interactions of the theory. 

2. Divergences due to the masslessness of the 

gluon: These appear in Feynman graphs, but pre- 
sumably are cut off by confinement effects in real 
QCD when the PCFs and PDFs are treated non- 
perturbatively in hadronic targets. 

At this stage of our work, we will use a model 
Abelian gauge theory with a nonzero gluon mass, so 
that we separate the mass divergences from other 
issues. Given that color is confined in QCD, we can 
expect a real physical cutoff of these divergences, 
which however is difficult to discuss within pure 
perturbation theory. 

3. Divergences due to the use of light-like Wil- 
son lines: We will call these rapidity divergences. 
In simple Feynman-graph calculations, rapidity di- 
vergences are frequently confused with the diver- 
gences due to the masslessness of the gluon, since 
both arise from regions where the plus or minus 
momentum of a gluon goes to zero. 

We will discuss the issues starting with fully integrated 
PDFs. Then we will examine the intermediate case of un- 
integrated PDFs, which are differential in k + /P + and k t . 
Finally we will examine the case of the fully unintegrated 
functions, the PCFs, of all the three types we will need: 
target related (like an ordinary PDF), jet related, and 
soft factor. 



A. Integrated PDFs 

For the fully integrated PDFs, the definition || given 
in Eq. (|11[) is entirely satisfactory. The primary parton 
fields have a light-like separation in the minus direction 
so that the Wilson line can be taken along the line join- 
ing the parton fields, as is needed to be consistent with 
factorization. UV divergences are canceled by renormal- 
ization, and there is a cancellation of all the rapidity 
divergences and of the final-state divergences associated 
with masslessness of the gluon. In a color singlet hadron 
with confinement there should be no other divergences. 



B. Unintegrated PDFs 

As for a simple unintegrated PDF, a common approach 
is to define it as a matrix element of parton fields without 
a Wilson line: 

J 167T 3 

x (p|^(0,u.-,w t )7 + ^(0)|p)i cg , (37) 

but with the fields defined to be in the light-cone gauge 
A + = 0. This definition is natural, because if the rules of 
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FIG. 7: (a) Range of rapidity for exchanged gluons in DIS. Rightmost is the target region, and leftmost is the outgoing jet 
from the struck quark, (b) Rapidity for gluons in PDF defined with light-like Wilson line (or for PDF defined in light-cone 
gauge without Wilson line). 



light-cone quantization are applied naively, the definition 
becomes exactly the number density of quarks. However, 
as Collins and Soper Q demonstrated, this definition has 
a rapidity divergence in the interacting theory. 

The rapidity divergence, from gluons whose rapidity 
goes to — oo, occurs even when all ultraviolet and mass 
divergences are cutoff. As illustrated in Fig. [3 the impor- 
tant range of gluon rapidity for the actual cross section is 
between the rapidity of the target and the rapidity of the 
jet. Naturally gluons to the right of the rapidity of the 
virtual photon can be considered as associated with the 
PDF, and gluons to the left as associated with the jet's 
fragmentation function, while gluons in the center belong 
to a soft factor, in the Collins-Soper-Sterman (CSS) fac- 
torization method But the use of light-cone gauge 
gives contributions to the PDF from gluons not merely 
from the positive rapidities that are naturally part of 
the PDF, but also from rapidities running all the way to 
— oo. Evidently, the parton density needs to be redefined 
so that it has some kind of rapidity cutoff at around the 
photon's rapidity. This can be accomplished by Soper's 
7] definition with a non-light-like gauge-fixing vector n 
of about the same rapidity as the virtual photon. Collins 
and Soper (CS) derived an equation for the derivative 
with respect to the direction of n; they used this equa- 
tion to show very generally the existence of a rapidity 
divergence in the light-cone-gauge limit n 2 /(n ■ p) 2 — > 0. 

The definition in Eq. ([57)1 is readily converted to a 
gauge-invariant form by using the fact that a Wilson line 
operator V w (n) in direction n is unity in the n ■ A = 
gauge. But a further complication is uncovered when the 
definition is made exactly invariant: 

P(x,k t ,v) = I ^L^ e -^ + ™-+** w t 

J 167T J 

x (p| ? /;(0, U ;-,w t )V^(n)/ n , u) ^7 + 14.(«)V'(0)b). (38) 

Here Wilson lines go out from the parton fields to infin- 
ity in the direction n. With only these, we would get 
exact agreement with Eq. (|3T|) in n ■ A = gauge. But 
as was pointed out by Belitsky et al. [13], strict gauge 
invariance requires that the Wilson line be completed in 
the transverse direction by a segment connecting the two 
points at infinity. This is accomplished in Eq. (|38p by a 



factor of the following operator 

I n , w ,o = Pexp(-ig J dsfA^zyj , (39) 

where the contour C is in the transverse direction and 
connects (0, oo, w+jto (0,oo,0 t ). 

Belitsky et al. [32| showed how this gauge link at in- 
finity is essential to get correct physics even in n ■ A = 
gauge; the naive definition Eq. (|3"Tj) is wrong. Their 
demonstration of the failure of Eq. (|3T|) was done in 
the context of one-gluon-exchange calculations of the 
Sivers function; this is the single-spin-asymmetry (SSA) 
of the unintegrated parton density. In Feynman gauge 
the Sivers function is obtained from from an imaginary 
part associated with the usual part of the Wilson line, 
and the gauge link at infinity does not contribute. But 
in light-cone gauge, the total contribution comes from the 
link at infinity. The naive definition Eq. (|37[) gives zero 
for the Sivers function, as observed by Brodsky, Hwang, 
and Schmidt [2j|. 

For this particular calculation of the SSA, the prob- 
lem with rapidity divergences when n is light-like did 
not appear, so the calculation in [32] was done in light- 
like gauge. But the problem of rapidity divergences does 
appear when more gluons are exchanged, and it does ap- 
pear with one-gluon exchange in the unpolarized case, 
as is verified by explicit calculation [10']. Furthermore, 
couplings of multiple gluons to the link at infinity should 
affect the unpolarized parton density in axial gauge. 

A satisfactory solution to all the difficulties is there- 
fore to use Eq. (|38[) as the definition of an unintegrated 
quark density, with now a non-light-like vector n for the 
Wilson lines' direction and with a gauge-link at infinity. 
Then n being non-light-like cuts off the rapidity diver- 
gence, and the CS evolution equation gives the effect of 
changing the cutoff. Apart from the gauge-link at infin- 
ity, the definition is exactly that of CS [2|. Presumably 
the complications made evident by the calculation of the 
Sivers function also infect the CS definition, which makes 
no allowance for the gauge link at infinity. 

Unfortunately, the use of a non-light-like vector to set 
the gauge condition or the directions of the Wilson lines 
produces some practical complications: Feynman graph 
calculations are algorithmically harder than when a light- 
like vector n is used, and the evolution equations have 
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inhomogeneous terms that are difficult to discuss explic- 
itly. The inhomogeneous terms are of non-leading power 
in the hard scale Q, so that they are neglected in phe- 
nomenological applications. The appearance of the non- 
leading powers in Q, however, suggests that the use of 
non-light-like Wilson lines might be preferred in studies 
of higher- twist effects (see, for example, the work of 
Collins, Hautmann and Metz 

[111 Sail 

suggested mod- 
ified definitions. They take the definition with light-like 
lines as basic. But to cancel rapidity divergences, it is 
divided by an extra factor involving the vacuum expec- 
tation value of certain Wilson lines. The non-light-likc 
vector needed to specify the physically necessary rapid- 
ity cutoff is in this extra factor. This represents a kind of 
generalized renormalization of the operators whose ma- 
trix element is the parton density. But we will not take 
this route in the present paper. 



C. PCFs 

Parton correlation functions (PCFs) are defined like 
parton densities, but without any of the integrals over fc t 
or k~ . The primary issue in constructing a definition is 
the choice of directions of Wilson lines. So we start by 
discussing what leads us to our choice. The issues are 
closely linked to those of rapidity divergences and the 
important regions of gluon rapidity relevant for a process. 

Compared with the case of PDFs (integrated or uninte- 
grated) some simplification occurs because we remove the 
integral over parton virtuality (or k~). Any rapidity di- 
vergence occurs when the rapidity of a gluon goes to — oo. 
At fixed transverse momentum, the divergence therefore 
occurs where the gluon's minus momentum goes to infin- 
ity. For emission of real gluons in a PCF, the divergence 
is therefore cutoff, because the minus momentum on any 
one gluon is restricted to P~ — k~ by the externally im- 
posed k~ . This occurs even if the gluon is "dressed" so 
that it decays to multiple particles. This contrasts with 
the case of an ordinary PDF, integrated or unintegrated, 
with its integral over all values of k~ . 

Although the use of exact parton kinematics in a PCF 
cuts off rapidity divergences when the external transverse 
momentum is fixed, it does nothing about virtual gluons. 
Moreover, when we work with Feynman graphs in a the- 
ory with massless gluons, the limit on rapidity expands as 
the transverse momentum is reduced to zero. So rapidity 
divergences reappear as part of the infra-red divergences. 
We regard rapidity divergences as particularly dangerous 
because a derivation of factorization associates the PCF 
or PDF with momenta that are related to the target. A 
rapidity divergence gives important contributions from 
momenta that are infinitely far away, thereby at least en- 
dangering the value of treating parton kinematics more 
exactly. 

Therefore we define all our PCFs with non-light-likc 
Wilson lines. There are three types: (a) A PCF in a 
target, generalizing the notion of parton density, (b) A 



corresponding object, a jet PCF, for fragmentation, gen- 
eralizing the notion of fragmentation function, (c) A soft 
PCF factor. These will be used to capture the physics 
associated with partons whose momenta are respectively 
in the regions: (a) target-collinear, (b) jet-collinear, (c) 
soft. 

For the target and jet PCFs, the direction of the Wil- 
son line is chosen to be approximately at rest in the 
center-of-mass, so as to correspond to a separation be- 
tween left- and right-moving momenta, as is natural for 
factorization. (For our purposes, we have generalized the 
notion of center-of-mass to any four-vector with zero ra- 
pidity so that it is applicable to space- like as well as time- 
like vectors.) As for the soft factor, it concerns gluons 
central in rapidity, and is defined in terms of a vacuum 
expectation of a suitable Wilson line operator. The Wil- 
son line has segments representing the active partons, 
and that therefore go in almost light-like directions. 

There will be an apparently unfortunate profusion of 
directions of Wilson lines. The associated complications 
are manageable once one recognizes that the directions 
provide rapidity cutoffs in each factor to ensure it only 
concerns momenta appropriate for the given factor. The 
factors do acquire extra arguments corresponding to dif- 
ferences between the rapidity cutoffs and the external 
momenta. But then the CS equation for the dependence 
on the Wilson line directions gives the dependence on the 
extra arguments. Thus the non-light-likc Wilson lines 
provide a tool for quantifying the evolution with respect 
to the available rapidity range, and hence with respect to 
energy. The correspondence between rapidity and angle 
for a massless particle is presumably the link that re- 
lates Collins-Soper evolution to the well-known angular- 
ordering rule for coherent gluon emission in parton show- 
ering. 

We leave open the possibility that the definitions may 
be replaced with definitions that use light-like Wilson 
lines with rapidity divergences removed by gauge invari- 
ant factors as in the treatment of the Sudakov form factor 
in Refs. [HI, [36J, but that is left for later work. 

The observations made so far in this section motivate 
our new definitions for the PCFs. In this section, we will 
simply state the definitions with qualitative remarks to 
motivate them. The justification for the definitions will 
come when we find we can set up a detailed set of ap- 
proximations which result in factorization with our given 
definitions for the PCFs. 

First, we define light-like vectors corresponding to the 
directions of the primary hadrons in the process: 

ur = (l,0,O t ), uj=(0,l,O t ). (40) 

We also define slightly non-light-like vectors, 

n T = (l,-e- 2 ^,0 t ), nj = (-e-l 2w l,l,0 t ), (41) 

with j/t large and positive, yj large and negative. Our 
notation is to use the letter u for light-like vectors, and 
n for non-light-like vectors; the subscript indicates which 
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direction of a hadron or an active parton is approximated 
by the vector. To achieve factorization, we will also need 
a vector to characterize the boundary between left and 
right-movers; it should correspond to the gauge- fixing 
vector in the Collins-Soper 0, Q formalism. Therefore 
we define: 

n s = (- e y>,e-y%O t ). (42) 

In accordance with the results of Collins and Metz [35j ] . 
we use space-like, not time-like, vectors. They found that 
when virtual gluon emission is included, space-like Wil- 
son lines give the widest and most universal factorization. 

The above vectors will appear as directions of Wilson 
lines in the definitions of PCFs, and they serve to pro- 
vide cutoffs on the rapidities of momenta in each PCF. 
Thus, each PCF is restricted to rapidities appropriate to 
its function in a derivation of the factorization property. 
During the derivation we will require that the rapidities 
2/t, Vs and yj correspond approximately to the target, a 
rest vector in the center-of-mass, and the outgoing jet. 
(The precise values will not be relevant.) After we have 
a factorization, we will wish to exploit the universality of 
the PCFs to relate processes at different energies. This 
will involve, for example, boosting the target PCF to 
change the target state from one energy to another. The 
boost will also apply to the vector n s , thereby giving it 
an inappropriate rapidity for proving factorization at the 
new energy. The CSS equation gives the dependence on 
y s , so that we can convert the PCF to the one appro- 
priate to the new energy. Thus, although a factorization 
proof can legitimately assume that y s — 0, i.e., that n s 
can be considered at rest in the center-of-mass, we leave 
y s as a parameter because we will need to exploit the 
2/s-dependence of the PCFs. 

In a subtraction scheme, we start the treatment from 
the smallest region and successively generate terms for 
larger regions, with subtractions to avoid double counting 
with previously encountered regions. Since the region 
of soft momenta is smaller than the regions of collinear 
momenta, it is sensible to define the soft factor first, but 
with some care to ensure compatibility with what later 
will be found to be appropriate for the definition of the 
PDF and the jet factor. 

1. Soft factor 

Soft gluons couple to the target jet, with its large plus 
component of momentum, and to the outgoing jet with 
its large minus component of momentum. This suggests 
that the soft factor is the vacuum expectation value of 
Wilson lines that are nearly light-like in the plus and 
minus directions. In coordinate space we define the soft 
factor by 

S(w,y T ,yj,n) 

= (0|4 T; »,o^K)^(n J )7„ j;i „,oK,(n J )^ t (nT)|0)H. 

(43) 




FIG. 8: Structure of the Wilson loop that appears in the 
definition of the soft factor (1 43 II . Dot-dashed lines indicate 
exactly light-like directions. 



This expression has non-light-like Wilson lines going out 
in approximately the plus and minus directions from a 
particular point in spacetime which we may choose as the 
origin of our coordinate system, times a conjugate am- 
plitude with emission from a different point w. Fourier 
transformation then gives a factor for the production of 
a final state of a given momentum. This represents emis- 
sion from outgoing cikonalized colored lines in directions 
appropriate to the quarks k and k+q in the parton model. 
That the Wilson lines are not quite light-like restricts the 
states to those appropriate for a finite energy process. 
They also provide cutoffs on rapidity divergences. 

Now choosing the n.j Wilson line to be outgoing natu- 
rally matches the idea that gluon radiation from this line 
concerns emission from the actual k + q line at the hard 
scattering. This suggests that, to match the incoming k 
line, the direction for the tit Wilson line should be in- 
coming from — oo rather than outgoing to +00. The work 
of Collins and Metz [35[ shows otherwise: the choice of 
an outgoing line (with color corresponding to an anti- 
quark, if the k line is an incoming quark) turns out to 
work better and to give broader universality properties. 

The paths for these Wilson lines are illustrated in 
Fig. [51 where we also indicate the exactly light-like di- 
rections. The gauge links, I„ T . W and I n .y,w,o, at infin- 
ity are needed for strict gauge invariance. It should be 
noted, of course, that the lines representing these gauge 
links in Fig. [5] should have components in the transverse 
direction (out of the page). Thus, the soft factor is the 
vacuum expectation value of a closed Wilson loop. 

Renormalized field operators are used in Eq. (|43| . In 
a renormalizable theory, Eq. (|43[) will contain UV diver- 
gence, both from the divergences that appear in the La- 
grangian, and from the fact that Eq. (|4*3"j) involve Wilson 
lines meeting at a cusp. Both types of divergence can 
be dealt with using standard renormalization techniques 
(as indicated by the subscript, R). The renormalization 
scale is /i. 
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FIG. 9: Path of the Wilson line (in a highly boosted frame) in 
the definition of the parton correlation function in the target. 
(The fully unintegrated PDF.) 



2. Target PCF 

The target PCF for a quark should involve a gauge- 
invariant expectation value of the quark fields inside the 
target. Therefore, a reasonable first attempt at a defini- 
tion (in coordinate space) is 

F(w,y p) y s ,fj,) 

= (p\#(w)VZ(n B )I n .. tW , ?2 VoK)V(0)b)R. (44) 

Here the Wilson lines are in direction n s , and in prov- 
ing factorization we will assume that the rapidity y s — 
h In /nj | is close to zero in the center-of-mass, i.e., 
that n s is approximately along the — z direction in the 
center-of-mass. That n s is s pac e- like is obtained from the 
results of Collins and Metz 



351 ] . As we explained earlier, 



exhibiting the dependence on y s allows the use of the CSS 
evolution of the PCFs. The above definition is compat- 
ible with the CS definition, where n s is the gauge-fixing 
vector for the axial gauge n s ■ A = 0. For exact gauge 
invariance we have also inserted a gauge link at infinity. 
The path for the complete Wilson line is shown in Fig. [5] 
The factor of 1/2 with 7+ is to keep the normalization 
the same as for a PDF. 

Although this is an excellent definition, we will find a 
different quantity arises when we first obtain a factoriza- 
tion. We will find gluonic effects that appear both in the 
soft PCF factor and this definition of the target PCF. 
To remove the double counting, we will use a related def- 
inition which our first proposal, Eq. (|44[) . divided by a 
factor related to the soft PCF: 

F mod (w,y p ,y T ,y s ,fi) 

(p|^H^(n.)J n ..„,o^-^o(n.)^(0)|p)ji 

~ (0\ll T . w0 VUnT)VUn s )I n ,-,w,oVo(n s )Vj(n T )\0} R 

(45) 

The denominator is the soft factor, but with the nj Wil- 
son line changed to have direction n s . The rapidity argu- 
ment, y p , is the exact rapidity of the target proton. (This 



should be distinguished from y^ which parameterizes the 
direction of the target associated Wilson line and points 
approximately in the target direction.) Notice that this 
definition is given in coordinate space. When we Fourier 
transform to momentum space, the division will be in the 
sense of a convolution product. 

We understand the meaning of the definitions in 
Eq. (43]) and Eq. (45]) as follows: 

• The PCF for the soft factor treats central gluons 
accurately, the tit and nj Wilson lines providing 
accurate approximations to the quark lines and as- 
sociated collinear subgraphs. 

• As gluons approach the target rapidity, the accu- 
racy of the nx Wilson line as an approximation for 
the incoming quark line is degraded in both the 
target and soft PCF. 



• The numerator in Eq. (|45| provides a good approx- 
imation for gluons in the target range of rapidity, 
and the denominator accurately cancels the bad ap- 
proximation for this same region in the soft factor. 



• In the numerator of Eq. 145 j) . central gluons, 
around the n s direction, are accurately given by 
an eikonal approximation of the form also appear- 
ing in the denominator, so we get a cancellation. 
Gluons of intermediate rapidity also cancel. 

• The non-lightlike Wilson lines in direction n s pro- 
vide strong cut offs in Eq. (|45p on gluon rapidities 
beyond the central region. 

• Similar ideas apply to negative rapidities and the 
correspondingly defined jet factor (next section). 

The lack of question marks on the equal sign in Eq. (|45p 
implies that this definition is the main definition. It has 
acquired a second rapidity argument, which is quite unde- 
sirable, but this does correspond to a definition by Idilbi, 
Ji, Ma and Yuan [3?], [H| for unintegrated PDFs in the 
context of SIDIS. However, various further reorganiza- 
tions, together with an application of the CS evolution 
equation can handle the dependence on the extra argu- 
ments. 

However, to match the derivation of factorization, an 
excellent choice for j/t is to be close to the target rapidity. 
In fact the simplest choice is to set ?/t = yp- In that 
case, the important dependence for which we need the 
CS equation is on y s . 



3. Jet PCF 

We define a jet PCF, to account for final-state-collinear 
behavior, just like the target PCF, in versions without 
the denominator: 
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J(w ) jetdirection ) y J) y s ,fi) = (0\tp(w)V^(-n s )I_ ns . wfi j V (-n s )ip(Q)\0) R) (46) 

and with the denominator: 

f f , • +A - ^ , (Omw)V^-n s )I_ nsVW ^-V (-n s )m\0)R fA7 , 

J mo d w, letmrection, yi,y s , a) = j j j . (47) 

(0|il n .. tBf0 K,(-n.)V 1 t(nj)/ nj . t0)0 Vo(n J )V t(-n a )|0) Jl 



pie, the momentum space PCF in the target is 



/ ,-''(0,0, O t ) 



(w + ,w , w t ) 



I(-n s ) 



FIG. 10: Path of the Wilson line in the definition of the frag- 
mentation function. 



The paths of the Wilson lines in this case are illustrated 
in Fig. [TOl The arguments of the jet PDF will include a 
momentum variable, analogous to the proton momentum 
in the target PCF, for the exact rapidity of the outgoing 
jet. We use a script J as the symbol for the jet PCF 
rather than J in order to distinguish the jet PCF from 
the jet bubble that appears in graphs like Fig. [1] 

A surprising feature is that in Eq. (I47p we use —n s for 
the central-rapidity Wilson lines, which is the opposite of 
what we used in Eq. (|45f) . This arises from a correspon- 
dence with different Wilson lines in the soft factor. For 
the target PCF, the Wilson line with direction n s is at 
negative rapidity relative to the target. In the rest frame 
of the target n s has a large minus component, and the 
Wilson line is associated with the outgoing k + q quark. 
However, «x is the nearly light-like vector that charac- 
terizes the momentum of the outgoing quark. Therefore, 
the minus component should be of the same sign as nj, 
i.e., positive, which is just what we have defined it to be. 
In contrast, in the jet PCF, Eq. (|4T|) . the correspond- 
ing Wilson line is to have the sign of its plus component 
match that of the target associated Wilson line, i.e., of 
riT- Thus we use —n s not +n s , given that we use a 
space-like Wilson line. 



D. Connection to factorization 



The momentum-space PCFs are determined by the 
Fourier transforms of the above definitions. For exam- 



F mod (k,y pi y T ,y s ,fi) 

dw + dw~d 2 w t 
32^ e 



-ik-- 



'F mod (w,y p ,yT,y s ,ii). (48) 



When we come to the issue of factorization it will be 
important to recall that the momentum space PCFs dis- 
cussed in this section are well-defined for all values of mo- 
mentum, even those that lie far outside the range that is 
meant to be accurately described by the PCF. For ex- 
ample, the momentum-space soft factor S(l, j/t, /-t), 
obtained by Fourier transforming Eq. (|43p . exists even 
for values of I that are far from soft. Of course, when 
the PCFs appear in the factorization formula for physi- 
cal processes, they should be large only for appropriate 
values of momentum. This is partly accomplished by the 
use of non-light-like Wilson lines in the definitions above, 
which, as we shall see in the next two sections, cut-off 
the contribution from light-like gluons. The motivation 
for writing down Eqs. (|43|) . (|45| . and (|47| will become 
clearer in the next few sections where we will show ex- 
plicitly what is needed for a factorization formula. This 
new factorization formula will be defined with exact over- 
all kinematics that takes into account soft and collinear 
gluon emissions. We will show how the definitions for 
parton correlation functions listed above follow naturally 
from the factorization of soft and collinear gluons in DIS. 



VI. SUBTRACTIONS 

The overall approach we use is a subtractive approach 
— e.g., [39l | — generalized from the Bogoliubov approach 
to renormalization. Up to power-suppressed terms, each 
graph r is written as a sum over a contribution for each 
of its leading regions: 

T = CijT + power-suppressed . (49) 

r of r 

A single graph typically has many different regions, each 
corresponding to a different graphical decomposition of 
the form of Fig. [TJc). As explained below the definitions 
of the terms CrT employ approximations and then sub- 
tractions to eliminate double counting between regions. 
We denote the chosen approximation corresponding to 
a particular region by the action of an "approximator" , 
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Tr, as in Eq. (|32|) for the simple case of parton model 
kinematics. 

The different regions can be ranked according to their 
sizes, e.g. a soft region corresponds to a smaller range 
of momentum than a collinear region, and is therefore a 
smaller region in a set-theoretic sense. We define a region 
as minimal if there are no smaller regions. The contri- 
bution from a minimal region i?o is simply the action of 
the corresponding approximator on the unapproximated 
graph, 



C_R o r = T Ro T. 



(50) 



For the contributions from larger regions, we avoid 
double counting by performing subtractions for the con- 
tributions from smaller regions. So we define 



c r t = t r (t- J2 c r ,y 



(51) 



R'<R. 



For a minimal region, Eq. (|51|) reduces to Eq. (|50| . so 
that it gives a valid recursive definition of C R T with the 
terms being constructed sequentially starting from the 
minimal region(s). 

As exhibited in Fig. [TJc), a complication is that the 
graphical representation of the regions does not directly 
correspond to factorization, because of multiple gluon 
connections between the different factors. This contrasts 
with the case of Fig. [T{b), where we have topological fac- 
torization. Therefore an important constraint on choos- 
ing the definition of T R out of the range of possibilities is 
that (if possible) the graphical factorization results in an 
actual factorization after a sum over graphs and regions: 



CrT = factorized form . 



(52) 



VII. 



KINEMATIC APPROXIMATIONS AND 
RAPIDITY DIFFERENCES 



The kinematic approximations that enable factoriza- 
tion to be derived utilize certain properties of Minkowski- 
space momenta. We now review them with a view to 
systematizing our later work. 

Corresponding approximations in a Euclidean space 
are much more trivial. Thus if p and q are two spa- 
tial momenta with |p| <C |q|, then we can approximate 
(P + q) 2 by q 2 , up to a power-law correction. This is 
simply because angles are bounded in a Euclidean space. 

In Minkowski space, we have to deal with unbounded 
rapidity variables instead of angles. (Rapidity is useful 
to us because our process has a preferred axis.) Sup- 
pose we have two 4-momenta k± and k 2 with rapidities 
defined by yi — ^ In \kf /k^\, and such that \k^k^\ is 
comparable to or bigger than kf t , as would be the case 
for an on-shell momentum. These could be, for example, 
the four-momenta of two internal gluon lines in a graph, 



T. We write the orders of magnitude of the (+,— ,T) 
components of each momentum as 



(fc+ , fcr , k itt ) = O (Me Vi , Me^' , M) , 



(53) 



where M is an appropriate mass scale. 

The interesting case will be where the rapidities are 
quite different, let us say e Vl ~ V2 ^> 1. Then ki ■ k 2 is 
dominated by one term: 



fci • &2 = k\k 2 



l + 0(e-( yi ^ 2 >) 



(54) 



We will be able to apply this in Fig. [ljc), for the nu- 
merators of the attachments of the gluons from B to the 
collinear subgraphs J and <&, and for the attachments 
of the gluons from the collinear subgraphs to the hard 
subgraphs. 

There is an interesting region, where Eq. fails 
because one momentum variable has particularly small 
longitudinal momentum components, i.e., |/c^~fc~| <C kf t . 
This is called the Glauber region, and it is a natural 
case to examine since it corresponds to a virtual parti- 
cle exchanged in small-angle elastic scattering, as in a 
final-state interaction. An important part of factoriza- 
tion proofs is to arrange for a contour deformation to get 
out of the Glauber region, when possible. See Collins and 
Metz [35[ for a recent treatment of issues relevant to our 
discussion in later sections; for the DIS reactions treated 
in this paper, a contour deformation out of the Glauber 
region is possible. 

It is worth observing that the word "region" has two 
slightly different connotations in our discussions. One 
refers to a locality in the space of 4-momenta, as in the 
explanation of the Glauber region in the previous para- 
graph. The other connotation is as a locality in the multi- 
dimensional space of loop or line momenta for a whole 
graph, as in Eq. (|51|) . For a graph with a single gluon ex- 
change, we often use the direct correspondence between 
the regions of the graph and the regions for the gluon's 
momentum. 

A propagator denominator 



k 2 ) 2 -m\ 



(55) 



needs a bit more care than a simple product k\-k 2 , since 
the appropriate approximation depends also on the rel- 
ative virtualities of k\ and k 2 . We again assume that 

we have deformed out of any Glauber region, and that 
e yi-V2 > i Then- 

1. If the virtualities of k\ and k 2 are comparable to 
each other, and both are comparable to or bigger 
than m 2 , then the denominator is dominated by k^ 



and k 



(h + k 2 f - to 2 ~ 2kt k 2 



(56) 



An elementary application is to the virtual photon 
in Fig. []Ja) where Q 2 ~ 2k + l~ when the initial 
quark is collinear to the target. 
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target-collinear and "J" for jet-collinear. The solid lines in- 
FIG. 11: A graph that yields a contribution to the soft and dicate the positions of the actual pinch singularities, and the 
collinear regions. dots indicate their extrapolations to where there is no pinch 

at the collinear singularities. 



2. But if we also have to treat the case that one mo- 
mentum, &2 say, has virtuality much less than that 
of the other, then although we can neglect k\ with 
respect to k^k^ , we cannot necessarily neglect k\ 
or m 2 . Thus we can only write 

(fci + k 2 ) 2 - m 2 ~ k\ - m 2 + 2k+k^. (57) 

This amounts to replacing hi by its minus compo- 
nent. This approximation will be the primary tool 
in deriving factorization. 

VIII. REAL GLUON EMISSION 

In this section we examine the simplest case of the 
gluonic corrections to the parton model that were sum- 
marized in Fig. QJc). This is given by the emission of 
one real dressed gluon [5(3] — Fig. [TTJ We will consider 
virtual gluon radiation in a later section. 

A. Regions for gluon exchange 

The gluon, of momentum l 2 , evolves into a final state 
represented by the bubble, Jg.xpih)- It attaches to a 
jet-associated bubble denoted by J {k + q,l%), and a 
target-associated bubble i> p (fc, I2, P). Here A and p are 
Lorentz indices, and the subscript, g indicates that this 
is the bubble associated with an outgoing gluon. 

We will restrict attention to the case that there is no 
production of extra jets of high transverse momentum. 
Therefore, by standard power-counting arguments, we 
only need to consider the contribution from three regions 
of the gluon's momentum: soft, target-collinear, and jet- 
collinear. These are fundamentally distinguished by the 
rapidity of the gluon relative to the target and the jet — 
Fig. [71 Each of these three regions corresponds to a par- 
ticular decomposition of the form of Fig.QJc), and for the 
moment we assume that no other regions are relevant. 

A complication is that there are two ways to character- 
ize the regions. One is by the Libby-Sterman [31| analysis 
in terms of the pinch-singularity surfaces (PSSs) of the 



corresponding massless graphs. The other is in terms 
of the very different rapidity ranges of, in this case, the 
exchanged gluon. The analysis in terms of rapidity is 
closely related to the use of angular ordering in leading 
logarithm approximations. 

For the Libby-Sterman analysis, the massless singular- 
ities, illustrated in Fig. [T^l are: A soft gluon singularity 
at zero gluon momentum 1% = (0, 0, t ), a target-collinear 
singularity at = ( z tP + , 0, t ), and a jet-collinear sin- 
gularity at I2 = (0, zjq~, Ot). Here, zt and zj param- 
eterize the position along the collinear-singularity lines 
in a frame-independent fashion. The momenta that are 
actually relevant are in neighborhoods that surround the 
PSSs in Fig. [121 Therefore, we need to introduce vari- 
ables that allow us to specify the proximity of the mo- 
menta to a PSS. For each PSS, we parameterize its neigh- 
boring momenta by what we will term radial and angular 
coordinates. Thus we write: 

• For the soft region 

l 2 = \(l+, 12,%). (58) 

The dimensionless "angular" variables 1% obey 
some moderately arbitrary normalization condi- 
tion, e.g., yN/ffl = 1. These variables parame- 
terize a surface of constant A surrounding the soft 
point 1 = 0. Then I2 2 is of order A 2 , with a coeffi- 
cient bounded away from zero and infinity. This is 
the property that enables us to estimate the errors 
on our approximations systematically. 

A formal implementation of the quasi-angular inte- 
gration can be made as follows: 

J d% ...= j d\ A 3 J d% ■ ■ ■ 

ee J d\ A 3 J d %^8(\- Jy^%?) (59) 

where we use the normalization condition proposed 
above. This formula is written in a form equally 



20 



suited for virtual gluon exchange. Any constraints 
on the invariant mass of the gluon from the nature 
of the final state are taken to be in the integrand, 
the part indicated by ". . .". 

The formula obviously does not give a Lorentz- 
invariant decomposition of the integration, but is 
adapted to the needs of the process. It is arranged 
so that power-counting in A is straightforward: The 
angular integral phase space J (PI2 is independent 
of A. Moreover, it is unambiguous that a small 
value for A corresponds to a small neighborhood of 
the origin in Fig. [T3J This would not be true if we 
had tried to characterize the soft region by specify- 
ing it in terms of a Lorentz invariant quantity, i.e., 

• For the target-collinear region 

h= f^ P ^£|f> A ' (60) 

Our choice for the normalization condition of the 
dimensionless angular variables is + % t = 1- 
The asymmetric scaling is suitable for a momentum 
highly boosted from a rest frame, and again ensures 
that I2 2 is proportional to A 2 . 

A formal definition of the quasi-angular integration 
is: 




In the Libby-Sterman terminology, zt is an "in- 
trinsic" variable for the PSS, parameterizing the 
position on the surface. Then A can be treated as 
measuring the distance from the surface, while I2 
parameterizes a (2-dimensional) surface around the 
PSS, for a given value of the intrinsic variable. 

• For the jet-collinear region 

h= \ —p,zjq-,\l%\, (62) 

exactly similarly to the the situation for the target- 
collinear region. 

For any of these cases, when the four-momentum has a 
virtuality of order a typical hadronic scale A 2 , then the 
radial variable A is itself of order A, and this can be 
regarded as the canonical size of I2 for the region. This 
gives a basic intuition about the meaning of collinear and 
soft momenta. (Here we temporarily assume that the 
zt 1 j variables are of order unity.) But we integrate over 



all accessible momenta (or up to some limit of order Q), 
so it is important to treat A as ranging from to order 
Q. 

The approximations we use will have typical fractional 
errors suppressed by a power of the various small mass 
scales, e.g., A/Q, A/Q, m/Q. As we move to larger A 
than the "canonical" value, the errors become larger. But 
at the same time, as we will see, the approximations for 
larger regions become useful, and the overall effect in 
the subtraction formalism will be that the total error in 
the sum of all the approximated contributions will be of 
higher twist. That is, the fractional error will be a power 
of a fixed hadronic mass scale divided by a large scale 
like Q. (Logarithmic corrections will slightly weaken an 
initially determined power-law suppression.) 

We can now compare the Libby-Sterman analysis and 
the rapidity analysis. With the above definitions, there 
is a large rapidity difference between the two collinear 
regions and this will be sufficient for us to obtain ap- 
propriate approximations. The Libby-Sterman analysis 
further requires zj and zt to be of order unity, so that 
the large component of a collinear momentum is of order 
the hard scale Q. While this is important for discussing 
hard scattering, it complicates the treatment of the im- 
portant subcase where one of these z variables goes to 
zero. In that case when A for a collinear region is suffi- 
ciently small (of order a mass times z), the gluon is then 
simultaneously collinear, by the rapidity criterion, and 
soft by the criterion of small size. But the gluon is not 
collinear by the Libby-Sterman criterion of energy being 
of order Q. This case is only significant for a massless 
gluon. To avoid a proliferation of special cases, we unify 
this case as much as possible with the collinear case. For 
our initial all-orders treatment in the present paper, we 
will cut off this region by the use of a gluon mass, as 
announced in the introduction. But we will not need to 
do this just yet. 

Related issues have arisen in the literature in the form 
of a distinction between a soft and a supersoft region [Z(| • 
In the language of this section, they are distinguished by 
the numerical values of A. When the components of I2 are 
of order A we are in the conventional soft region; when 
they are of order A/Q 2 we are in the supersoft region. 
(This will be discussed further in Sect. IVIIIDh Again 
we will unify as much as possible the soft and supersoft 
regions. 

B. Unapproximated graph 

Before any approximations, the formula for Fig. [11] is 

r («) = fl^V f d% f d*k 
4tt J (27r) 4 7 (2tt) 4 

x Tr[-f u j K (k + qJ 2 h^ p {k,l2,P)] Jg; K p(h). (63) 

Here the superscript ( R > denotes graphs for emission of 
a real dressed gluon, and, as before, P M „ represents a 
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projection for a particular chosen structure function. We 
use bars on $(fci, l 2 ) and J{ki,l 2 ) to distinguish bubbles 
with an extra external gluon from those in Fig. [ljb) . The 
indices n and p on J and $ are now for the coupling to 
the gluon, not for a decomposition over Dirac matrices. 
The final-state bubble for the gluon in Fig. QT] is denoted 
by Jg-Kp- In the approach with standard kinematical ap- 
proximations this gluon would be put on-shell, and there- 
fore we would have J g - Kp oc J2 po \ e ^ e p^{%)^ where e is a 
gluon polarization vector. Since we keep exact kinemat- 
ics we do not perform these approximations. Finally, we 
have left implicit the quark-flavor label j on these quan- 
tities. 

Our strategy is as follows: We start with the smallest 
region, the soft region, and construct an approximator 
Tg that is (a) accurate in the soft region, and (b) suit- 
able for the use of a Ward identity argument to bring 
the total soft contribution, ^pCsT^, into a factorized 
form. The subtraction method requires us to extend the 
integration in the soft term to larger regions. Then we 
follow similar steps to construct approximators Tj and 
Xt for the collinear regions. Again, these have to be 
compatible with Ward identity arguments. Application 
of the methods of Sec. I VII will provide subtractions that 
compensate double counting between the terms for differ- 
ent regions, so as to ensure that the sum of these terms, 

gives an accurate approxima- 
tion for the union of the regions, including all intermedi- 
ate cases. This therefore deals with all regions involving 
low transverse momentum for 1%, i.e., for l 2 . t <C Q, with 
relative errors being approximately of order l 2i t/Q- 



C. Soft Region 

We now define our approximation for the soft region. 
The method is that of Grammer and Yennie [4l|, as ap- 
plied in factorization proofs (e.g., [j|). 



In the Breit frame, the target is boosted to have a 
large plus component of momentum, of order Q, while 
the outgoing jet is boosted to have a large minus com- 
ponent of momentum, also of order Q. Therefore, for 
the coupling of the jet and target bubbles, J K and $ p , 
to the exchanged gluon, we may characterize the sizes 
of the vector components by their transformations un- 
der boosts. The largest components have n = — and 
p = + respectively. Relative to the largest components, 
the smaller components have sizes 



J- \Q 2 Q 
$+ \Q 2 Q 



J- \Q Q 
$+ \Q Q 



(64) 
(65) 



These power laws result both from the size of the compo- 
nents of the momentum l 2 of the exchanged gluons and 
from the sizes of the components of the collinear mo- 
menta, which are boosted from their rest frame. For the 
moment, we treat the collinear momenta as having trans- 
verse momenta of order a normal hadronic mass scale A. 
At first sight, the Lorentz boosts to get collinear mo- 
menta indicates that the non-leading longitudinal com- 
ponents, J + and $~ for the collinear subgraph, would 
be of order A 2 /Q 2 relative to the large components. This 
would in fact be correct if the minus component of the in- 
jected soft momentum l 2 were sufficiently small, i.e., if A 
were less than about A 2 /Q. But when it has a larger 
value, e.g., the "natural" value for a soft momentum 



In 



A, some lines of acquire this size for their mi- 



nus momentum. Correspondingly <!> /<I> + increases to a 
size X/Q. 

To leading power, (in either X/Q or A/Q) we need 
to keep only the leading polarization components, and 
we make the following string of approximations to the 
product of bubbles in Eq. 



J K (k + q,l 2 ) J g;Kp (l 2 ) $ p (k,l 2 ,P)~J-(k + q,l 2 ) J+-(l 2 ) $ + (k,l 2 ,P) 

~J(k + q,l 2 )-l 2 4 p k-Hk,l 2 ,P). (66) 

'2 '2 

Here, our aim is to obtain a form in which the gluon momentum l 2 is contracted with each jet factor, a situation in 
which we can apply a Ward identity. The critical step is in the last line, where we use the fact that in the soft region 
l 2 ■ $ ~ l 2 § + and J ■ l 2 — J~l 2 ■ This step requires that the longitudinal components of l 2 be comparable to each 
other, which in turn requires that the rapidity of l 2 be small. It also requires that l 2 be outside the Glauber region, 
as is always true for real gluon emission. (Recall that the Glauber region is where \l 2 l 2 | <C l\t ) 

Within the soft region, the relative error in the approximation is then of order X/Q. However, in the factorization 
formula, we will integrate over the whole accessible range of l 2 . This will of course take us outside the soft region 
where the above approximations are accurate. By itself this is no problem, since such a contribution will eventually 
be accommodated by proper double-counting subtractions in the treatment of other regions. But, particularly when 
we apply the same soft approximation to virtual gluons, the denominators and will create rapidity divergences 
that are completely unphysical. The simplest solution is to replace these denominators by dot products with nj and 
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n-r, the vectors defined in Eq. (f4"Tj) : 

n K T (U\n p 

J K (k + q, h) J g -, Kp (h) $ p (fc, h,P) =s J(A + g, / 2 ) • I 2 Tl J 9 :Tn , H ^ • *(*, «a, i>) ■ (67) 

(«2 • nj — «e) (t2 • jit + «ej 

I 



soft region 




VJ - yT 

2/Q 



FIG. 13: Distribution of gluon rapidity. The solid line is 
the exact distribution, whereas the dotted line represents the 
distribution obtained from the soft approximator, Ts. 



? q 




q + k = 1 



FIG. 14: Corrections to current vertex, and kinematic pro- 
jection for hard scattering. 

In the soft region, I2 ■ nj — l£ , and I2 ■ ht ~ I2 , so that 
the accuracy of the approximation is unimpaired. Be- 
yond the soft region, these replacements provide cutoffs 
in an integral over the rapidity of £2- We also make a 
corresponding change in the J g part of the numerator, so 
that after we apply a Ward identity to sum over all at- 
tachments of the gluon to $ and J ', we obtain exactly a 
term where the gluon attaches to a Wilson line operator. 
Finally, we introduce the ie prescriptions appropriate to 
the directions of the Wilson lines determined in [HJ . 
Let us choose the rapidities j/t and yj that define the 



vectors n.x and nj to match the target and jet rapidities. 
Then there is a natural correspondence between the ra- 
pidity cutoff provided in the soft approximation and that 
provided by the <& and J subgraphs before approxima- 
tion. This is illustrated in Fig. [1~3"1 

One further refinement in the exact definition of the 
approximation is needed to ensure that it works suitably 
when there are higher-order hard corrections at the elec- 
tromagnetic vertex, the left-hand-side of Fig. Q31 The 
momentum I2 flows through the vertex, so that the hard 
factor can vary with I2. In the soft region (of the I2 inte- 
gral) this is an unimportant power-suppressed effect, but 
in the complete integral over all I2 it can create a big ef- 
fect. When multiple soft gluons are exchanged, we will be 
likely to find that the definition of the soft factor needs to 
be changed from Eq. (|43|) in an inconvenient way: instead 
of Wilson lines joining at two point vertices at and w, 
we will have nonlocal vertices, with the nonlocality gov- 
erned by detailed properties of the hard scattering. To 
avoid this issue unambiguously, we define the hard scat- 
tering subgraph to be evaluated at suitably projected mo- 
menta that stay fixed as I2 varies. This is readily done by 
requiring from the beginning that the external momenta 
of the hard subgraph always be projected down to the 
parton model values, as in Fig. Q3] There we use the mo- 
menta k = (—<?+, 0,0 t ) and I = q + k = (0, q~, t ) that 
we defined earlier. Thus the incoming and outgoing mo- 
menta of the approximated vertex are exactly massless, 
on-shell and independent of / 2 - Naturally, we also need 
to apply the projections Vt and Vj in the Dirac algebra, 
exactly as in the basic parton model. 

Putting these elements together gives the definition of 
the approximator for the soft region for Eq. (|63[) : 



{2*y J (2^) 4 v.-. n (| 2 ■ nj - ie) (J 2 ■ nr + ie) 

(68) 

Now apart from the explicit gluon line, the only off-shell external lines of $ and J are the quark lines at the photon 
vertex. So an application of a Ward identity to the contraction of I2 with these factors, summed over graphs, takes 
the gluon line outside of $ and J , to give 



T B r<*> = ^ / 4% / ^* Tr [VrfV, J(k + q -l 2 ) *(*, P)] ^^^p^K (69) 

47T J (2-7r) 4 J (27r) 4 (I2 ■ nj - it) (I2 ■ n T + ie) 



Notice how the bubbles are replaced by those that were used in the parton model, in Eq. (|24p . but with the momentum 
in J shifted by — Z 2 . Also, there is an overall minus sign, and a factor Cp that arises from the color matrices on the 
quark lines. As for the Ward identity, its derivation is well-known, for the case of the model Abelian gauge theory 
that we use at the moment — see e.g., [42|, p. 339]. We review the derivation in App. [B]in the context of graphs such 
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FIG. 15: Factorization of one soft gluon. 

as Fig. QT]and Eq. Complications arise in the non-Abelian case which does not yet have a complete treatment. 

We postpone the issue of the corresponding derivation in a non-Abelian theory (i.e., QCD) to later work. 
Finally we rewrite this result using the notation we used for the case of the pure parton model: 

X> r(R) = ^/ H / (27r)4 ^ +g _) \H ( q , k)\ 2 P)J~(k + q - h) (h,m, y 3 ). (70) 



which we notate diagrammatically in Fig. 1151 The indices 
"+" on $ + and "— " on J~ now denote, not Lorentz 
indices for a gluon, but the same projections concerning 
the leading part of the Dirac matrix structure that we 
used in the parton model in Sec. IIVI We define the one- 
loop rcal-gluon contribution to the soft factor as 



S^(/ 2 ,y T ,yj) 



-g 2 C F n K r J g . K p(l 2 )n? r 
(h ■ n 3 — ie) (l 2 ■ n T + ie) ' 



(71) 



and as indicated in Fig. [TSJ this is obtained from the 
Feynman rules from our definition of the soft factor in 
Eq. (|43|) . The lowest-order hard-scattering factor squared 

.Ho is given by Eq. (f35|) . so it is exactly the same as 
we found in the parton model approximation. This is 
necessary if factorization is to hold; the hard scattering 
does not depend on how many gluons attach to Wilson 
lines in the soft factor or on their momenta. 



D. Accuracy and limits of soft approximation 

After application of a Ward identity, we get a simple 
soft factor in Fig. [T5l with one graph for the ^-dependent 
soft factor. However, depending on the precise size and 
rapidity of 1%, different kinds of graph will dominate in 
Fig. QTJ before the Ward identities are used. This leads 
to complications if one wishes to set up factorization 
by considering individual graphs with on-shcll final state 
partons; as we will now show, different types of graphs 
dominate depending on how soft the radiated gluon is. 
Therefore, the unified treatment of the soft region dis- 
cussed in the previous subsection, with all graphs implic- 



itly included in the final state bubbles, has the notable 
advantage of dealing with all types of soft gluon behavior 
at once. 

We illustrate this by examining the graphs in Fig. [T|)] 
in a model field theory. For this we use an Abelian gauge 
theory supplemented by a color-singlet scalar field, which 
we treat as describing the model's hadrons, and which we 
use for the measured initial- and final-state particles. By 
using an Abelian gauge theory, we are allowed a gluon 
mass m gi which can be zero or nonzero; we use this 
to conveniently illustrate the issues associated with the 
mass of the gluon. 

First, without gluons we have graph (a), which gives 
a parton-model description of DIS with a parton den- 
sity and a fragmentation function to describe an observed 
hadron in the jet — the target splits into a quark and an 
antiquark, each with a large plus component of momen- 
tum, and the incident virtual photon scatters from the 
quark. 

Then we add one extra emitted real soft gluon. To keep 
the discussion simple, we restrict our attention to the sit- 
uation where the rest of the final state is near the parton 
model region. That is, in the final state, the transverse 
momenta (in the Breit frame) are limited to at most or- 
der A, and the hadron and quark fields have masses of 
order A. The struck quark has a plus momentum of order 
Q, and minus and transverse components less than or of 
order A. Later we will weaken these conditions. 

There are graphs in which the gluon attaches only to 
the target or jet parts of the graph; these simply provide 
corrections to the parton density and to the fragmenta- 
tion. There remain 4 graphs of interest, (b) to (e) in 
Fig. EH (plus their Hermitian conjugates). 
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(d) (e) 

FIG. 16: Illustrating the different detailed leading regions for an exchanged gluon in a spectator model. The double lines are 
for a color-singlet scalar field treated as an analog of a hadron. The dashed arrows in graphs (b) and (c) indicate the lines that 
are pushed far off-shell by the inclusion of the extra gluon. 



We first treat the case that the extra emitted gluon has 
small rapidity and has transverse momentum of order A. 
Then it pushes some of the quark lines off-shell. For 
example, in graph (b), these requirements and the other 



requirements listed above on the spectator antiquark and 
the outgoing hadron-quark system imply that (k+q) 2 and 
(k — I2) 2 are of order AQ: 



(q + k) 2 
1 



(q + k- l 2 f - q 

1 



2l 2 -{k + q) AQ' 

1 



(k - I2) 2 -m 2 k 2 - m 2 +q-2l 2 -k AQ' 



(72) 
(73) 



Here we have used the fact that (k + q)~ ~ Q and k + ~ 
Q. Although in the upper part of the graph we have 
routed I2 towards the final state on the quark line of 
momentum g+A; — Z2, it is the line q+k that goes off-shell. 
The two lines listed above are the only such far off-shell 
propagators in graph (b), so that the two 1/Q factors 
are compensated by a factor of Q 2 from the numerator. 
Thus there is no suppression of graph (b) 

In contrast, there are at least three far off-shell propa- 



gators in graphs (c) through (e). Therefore, graph (b) 
dominates, and the others are power suppressed. In 
graph (c), the lines that are far off-shell are indicated 
with dashed arrows. 

As long as the mass of the gluon is of order A, and the 
transverse momenta in the target and in the jet system 
are also of this magnitude then we can regard graph (b) 
as the only important graph at this order and we are 
finished. 
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But if we break with the requirements listed above 
by allowing the gluon mass to approach zero, then the 
transverse momentum of the gluon can go much smaller 
while keeping leading power contributions in graphs (c) 
through (e). Alternatively, we could raise the transverse 
momentum in the target and/or jet systems, for exam- 
ple, to order y/AQ, while still preserving the essential 
collinearity. In these situations the relative importance 
of graphs (b) through (e) in Fig. [TBI changes. 

Let us use a zero gluon mass and continue to keep its 
rapidity central. When its transverse momentum is re- 
duced to about A 2 /Q, there is no longer a penalty from 
off-shell propagators: all the denominators are of order 
A 2 . When the gluon's transverse momentum is reduced 
much further, graph (b) becomes unimportant, because 
the decreasing denominators in l|72p plateau at order A 2 . 
For such supersoft gluons to contribute we must have 
denominators that continue to decrease. This happens 
only for graph (e), where the gluon attaches directly to 
final-state colored lines. This is in fact just the ordinary 
IR divergence that appears when a massless gauge boson 
is emitted from an outgoing on-shell fermion. In a suit- 
able sum over final states, the ordinary IR divergences 
from vanishing transverse momentum cancel against the 
graphs for virtual gluon emission. 

All of these cases are covered by our general Ward 
identity argument. We just let a soft gluon couple in all 
possible ways to the collinear subgraphs. Some of these 
are smaller than others, but that does not matter. 

These results can be generalized to include also 
collinear gluons, i.e., of large rapidity. This gives Fig. 
U7f iL where we plot the regions in gluon transverse mo- 
mentum and rapidity where the different graphs domi- 
nate. The figure is labeled by letters that indicate which 
of the single gluon radiation graphs in Fig. [TBI dominate 
in different kinematical regimes. Naturally, on a bound- 
ary between the regions the graphs associated with both 
sides of the boundary are important. The soft region cor- 
responds to the part of the graph near the vertical axis, 
the target-collinear region to the part at positive rapid- 
ity, and the jet-collinear region to the part at negative 
rapidities. 

We now describe Fig. \T7\ i) in more detail. We start 
by describing the top diamond-shaped area where graph 
(b) gives an important contribution. For a massless real 
gluon in the final state, we may write its momentum as 



h = 



\ht[ 
V2 



\h 



V2' 



h 



(74) 



Then the condition that if /Q < 1 gives one bound on 
the rapidity for the area where (b) contributes: y < 
\n(Q/lJ). Likewise, the condition I2/Q gives a bound 
y > — ln(Q//J). These bounds give the top two diagonal 
lines for the boundary of the graph- (b)-dominant area in 
l :ii. [TTtii. 

Furthermore, the propagator q + k — l 2 should have 
virtuality at least of order A 2 , otherwise we gain by going 
to graph (d) with the gluon attaching directly to the 



final-state quark. This gives a bound y > — h^Q^t/A 2 ), 
the lower left edge of the diamond. The corresponding 
bound, 



< 



ln(QZ 2 ,t/A 2 ), on the target side gives the 
remaining side of the diamond. When I2 strongly violates 
any of these bounds, there is a power-law suppression 
relative to the largest contribution. 

Next we obtain the triangular area where graph (c) 
becomes important. For graph (c), the upper end of the 
gluon line still connects to an internal quark line, so the 
bound y > — \n(Ql 2 .t/ A 2 ) also applies to graph (c)'s area, 
i.e., the continuation of the lower left boundary of the 
region for graph (b). But on the target side, we need 
V ^ m ((%,t/A 2 ), to avoid gaining two far off-shell prop- 
agators. In addition, there is a suppression of the contri- 
bution whenever y is bigger than the target rapidity yx, 
since then the previous dominance of the 2^2~(-P + — k + ) 
term in the P — k + I2 propagator is cutoff by the term 
2I2 (-P - — k~). This gives the right-hand vertical line 
bounding the area for graph (c). 

Exactly similar considerations give the area where 
graph (d) is important. 

The bounds relative to target and jet rapidity, yj < 
y VTi actually apply to all the graphs, but are au- 
tomatically implied by the other bounds except for two 
cases: the target rapidity bound for graph (c) and the jet 
rapidity bound for graph (d). 

Finally, in graph (e), the gluon connects to both of 
the final-state quarks. Whenever l 2 goes far outside the 
bounds given by the lowest two diagonal lines in Fig. 
fTTH ). pairs of lines on the target or jet side are made off- 
shell by much more than A 2 , and we get a suppression. 
In addition, the rapidity is restricted to be between the 
jet and target rapidities. Therefore, graph (e) dominates 
in the area beneath the wedge shape at the bottom of 
Fig.Efti). 

Notice that if the gluon had a mass of order A, there 
would be a suppression of the region with fc t much less 
than A. Then graph (b) would dominate in the soft re- 
gion, and graphs (c) and (d) would only contribute in the 
target- and jet-collinear regions, while graph (e) would 
always be power suppressed. This shows some of the es- 
sential complications caused by the masslessness of the 
gluon in Feynman graph calculations, even though the 
regions of supersoft gluon momenta are presumably cut 
off by non-perturbative confinement in real QCD. 

Our definition of the soft approximation, adapted [43| 
from the Grammer-Yennie [4l| paper, is arranged to 
avoid any need to discuss the details of which graphs 
are important in different ranges of soft if ■ The only ap- 
proximations are made in the numerator coupling each 
jet factor to the soft gluon l 2 - These approximations 
become 100% inaccurate when either the energy of the 
gluon is of order the jet energy or when the rapidity of 
the gluon is comparable to the jet rapidity. For the cou- 
pling to the target, the magnitude of the fractional error 
is then the maximum of if /Q, A/Q, and e~( yT ~ v \ where 
j/t is the target rapidity. For the coupling to the jet, the 
fractional error is similarly the maximum of l 2 /Q, A/Q, 
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(e) 

(i) (ii) 

FIG. 17: (i) Ranges of gluon transverse and rapidity where the different graphs in Fig. [16] dominate. We use a logarithmic 
scale for 1% ■ The upper diagonal lines correspond to where the energy of the gluon in the Breit frame is of order Q, and the 
vertical lines correspond to the rapidities of the jet and the target, (ii) The case that the gluon attaches to the target remnant 
and that XBj is close to unity. 



and e~( y ~ VJ \ (The calculation is to be done in the Breit 
frame, and we have assumed XBj is not close to unity, 
otherwise the limits are decreased for accuracy of the 
coupling to $.) 

These error estimates are arranged in a form that is 
equally suitable for the case that I2 is collinear to the jet 
or target. For the soft approximation, we need to apply 
all the error estimates simultaneously and then we find its 
fractional error is the maximum of X/Q, A/Q, e~( yT ~ v \ 
and e" fe ~ ai) , where A is defined by Eq. ([58]) . That is the 
soft approximation is valid when A « (J, e VT <C e v <C 
e yj , and of course when A and the transverse momentum 
scale of the jets is much less than the hard scattering 
scale Q. Thus the gluon has to have low momentum and 
have central rapidity. 

Observe that a complication ensues when xej gets close 
to unity. For small or moderate xbj, we expect the out- 
going jet to have large positive rapidity as in Fig. fTTH). 
But, for iBj ~ 1, the target remnant has low energy and 
rapidity, so that the range of applicability of the soft re- 
gion is very restricted on the target side, as shown in Fig. 
I17f ii) : The target region has a wide range of rapidity. 

E. Target-Collinear Region 

Now that we have characterized the smallest region, a 
treatment of the collinear regions follows naturally with 
the aid of the subtraction method described in Sect. I VII 
For the contribution of the target-collinear gluons to a 
graph, such as the single gluon emission graph of Fig.fTTl 
we now construct an approximator, Tt- To avoid double 
counting, the subtraction formalism requires the contri- 
bution of the target-collinear region to be obtained by 
applying Xt to the graph minus its soft approximation: 

c T r(«) = T T (r^ - T s r<-V). (75) 

At the top end of the 1% gluon, it attaches to a subgraph 



where the rapidity is much more negative, when the gluon 
is in the target-collinear region. All the same issues about 
leading polarizations apply as for the case that the gluon 
is soft, so at this end of the gluon the approximator is 
very similar to the soft approximator Tg. 

As we discussed earlier, there are two ways we could 
define the target-collinear regions. One is that the gluon 
has low transverse momentum (e.g., of order A) and its 
plus momentum is of order Q in the Breit frame. In the 
case of the gluon-exchange graphs in Fig. [TJ2] only graphs 
(b) and (c) are then important; graphs (d) and (e) are 
suppressed because they have extra off-shell propagators. 
There is one far off-shell line: the quark k + q between 
the gluon attachment to the upper subgraph and the hard 
vertex has virtuality of order Q 2 . We could correctly con- 
sider this line as part of the hard subgraph. (Note that 
although I2 is not routed through this line, we have im- 
posed a low-mass requirement on the hadron-quark part 
of the final state, immediately to its right. So the large 
plus momentum of I2 actually flows on the k + q line.) 

However, when the gluon is massless, its transverse 
momentum can be very small, and then it can be useful 
to define the target-collinear regions by the gluon rapid- 
ity being comparable to the target rapidity, e y ~ e VT . 
As shown in Fig. I17f i). which graphs contribute depends 
on exactly how small the transverse momentum is. In 
all cases, the same method of approximation applies for 
the coupling of the gluon to the upper subgraph J . The 
Ward identity argument for summing the contributions 
will work; it will extract the gluon from the upper sub- 
graph and convert to couple to a Wilson line indepen- 
dently of which of the graphs are involved. 

In view of these issues, it is not totally obvious where 
to apply the projection on the Dirac matrices. So we 
formulate the approximator in two stages. The first stage 
just involves writing a Ward-identity-compatiblc form for 
the coupling to J, just as in Eq. f57]) , to obtain 




TT r<,, = %/^/^ [y,, ■ + ,, h) y*(*, (,, p)] jpiSL- w«0. <*» 

Here we use the vector n s of nearly zero rapidity, rather than the vector nj of very negative rapidity that we used 
in the corresponding part of the soft approximation. This provides a cutoff at central rapidities, as appropriate for 
a collinear region. When l 2 has a large positive rapidity, both vectors agree in projecting out the plus component: 
h ■ n s ~ l 2 ■ nj ~ 1%. 

Again, as before, we apply a Ward identity, but now only on the jet side, to obtain 



J(k + q - l 2 )Y 



to ■ n. - it 



(77) 



We identify the square bracket factor in Eq. ([77)) as a contribution to the numerator of the PCF defined in Eq. (l4"5")) 
with the gluon coupling to the Wilson line. The remaining factor in the integrand is exactly the same as the jet factor 
in the ordinary handbag diagram, Eq. (1241) . aside from a different labeling of the loop momenta. 

The definition of the collinear approximator is completed by applying the parton-model approximator at the elec- 
tromagnetic vertex, as illustrated in Fig.[THJ We also make a shift of integration variable, replacing k by k — l 2 , so as 
to make the correspondence with the simple handbag diagram clear. 



T T r(«> = -L / 4i^r~ + *) k)\ 2 \l ^Mr^-v 



^ ' 2vr J (2Tr) 4 -q+q- v ^ 1 n [J (2tt 



?l$P(k + l 2 ,l 2 ,P) 



(78) 

Here J~ and \Hq\ are exactly the same jet factor and hard scattering coefficient as in the simplest handbag diagram 
Fig. HJa), and the factor in braces corresponds to the numerator in the definition of the PCF Eq. (|45[) with one gluon 
connecting to the Wilson line. 

Finally we need the double-counting-subtraction term in Eq. (|75[) . The new item needed is the result of first applying 
the soft approximator Ts and then the collinear approximator Xt- The collinear approximator simply modifies the 
upper vertex of l 2 by replacing the vector nj by the vector n s : 

T T T s j K (k + qJ 2 )=T T J(k + q,l 2 )-l 2 
= J(k + q,l 2 )-h 



h ■ nj 
nj ■ l 2 



h ■ nj — it h • n s — ie 
n K 

= J(k + q,h)-l 2l s — (79) 

t 2 • n s — ie 

Applying Ward identities now gives us a term just like that for the soft approximation Fig. 1151 except with the change 
from nj to n s . However, we now must identify the eikonal factor as part of the PDF. 
We therefore write the contribution of the target-collinear region as 



r 



CtT(R)= ^ J (£)4 -q+q- Wa(<l,k)\ 2 J-(k + q)F m) (k,P), (80) 
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where 



Ft 



(R,l 



(27T) 



\\ Jg;K P {h) — 



-Tr 



2 • n s - le 



-&(k,l 2 ,P) 



$ + {k + l 2 ,P)S( R >V(l 2 ,n T ,n s ) 



(81) 



We now recognize F(ri) (k, P) as a part of the PCF due to one-gluon exchange with the Wilson lines in its definition, 
Eq. (|45p . The one-gluon exchange term is found from expanding the Wilson line operators in powers of the coupling. 
Then the first term in ([51]) , as we have already noted, is from the numerator of Eq. (j4"5)) . The second term arises 
from the 0(g 2 ) term in the denominator. In coordinate space, this multiplies the lowest order term in the numerator. 
Fourier transformation gives the convolution product in the second term in Eq. (fST|) . as illustrated in Fig. [T51 
These results support the correctness of Eq. ([45]) as the definition of the PCF. 



F. Jet-collinear region 



The treatment of the remaining region, the jet-collinear region, is very similar — one follows the same steps as in 
the target collinear case. For completeness we state the result here. The result for the contribution of this region is 



^ C ^^r j( r(«)-T s r(«)) = i/^ 



<Vh ' 1 -\H (q,k)\ 2 J iRtl) (k + q)^+(k,P), (82) 



r r 

where the one-gluon-exchange contribution to the fragmentation PCF is 



J{R,i)( k + q) 



d% 

(27T) 



Tr 



\j K {k + q,h) 



-gJg; K p{h)n P s 

l 2 ■ n s - ie 



J-(k + q-l 2 )S {R ^{l 2 ,n T ,-n) 



(83) 



In obtaining this b y m odifying the derivation for the target-collinear region, we replaced n s by — n s , in accordance 
with the results of [35( and the discussion in sect. IVl 



G. Hard Gluons 



Although the issue of extracting the NLO hard scattering component is beyond the scope of this article, it is worth 
describing the general method we would follow. The NLO hard contribution from the single gluon emission diagrams 
will follow naturally in the subtraction formalism: 



T H {k,l 2 ) 



r< fl >(Ma 



Tf\k, 



r { j R) (k,i 2 ) 



(84) 



Here, IT s j(k,l) represents the result of applying the 
soft, hard, or jet approximator to the unapproximated 
graph. 



IX. VIRTUAL CORRECTIONS 

In this section we analyze in detail the diagrams with 
virtual corrections for the case of one gluon attachment. 

First, we note that all situations with gluons entirely 
confined to the jet or target subgraphs are already cov- 
ered by the basic parton- model argument in Sec. IIVI 
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FIG. 20: Virtual correction graph. 



These gluons are already included in what we mean by 
the jet and target subgraphs. 

Therefore we need to analyze graphs with a gluon ex- 
changed between the jet and target subgraphs, Fig. [201 
The general procedure follows that for real gluon emis- 
sion. The expression for the unapproximated graph is 



4tt J (2tt) 4 J (2tt) 4 
x Tr[ 7 ' J K (k + q, l 2 ) ^ l 2 , P)] G Kp {l 2 ) , (85) 

where <t p and J K are the bubbles with one extra gluon 
attached. These bubbles are the same as for real gluon 
emission, except that the gluon is on the left of the final- 
state cut in both bubbles. The subscript 'vc' means virtual 
correction. This formula is just like ([63)1 except that the 
cut dressed gluon propagator is replaced by an uncut 
propagator: 



Jg;K,p{h) — * G Kp (l 2 ) 



(86) 



A. Soft region 

We start with the smallest region, the soft one, which 
was appropriately defined in Sec. IVIIIl Following the 
procedure used for real-gluon emission, we find that the 
Grammer-Yennie method applied to (|85p gives 



e 2 P 



<Pk 



4tt J (2tt) 4 J (2tt) 4 
Tr[VrfV 3 J{k + q, l 2 ) ■ h VrfVr h, P) • h] 
< n^G Kp (l 2 )n^ 
(nj • h — ie)(nT ■ h + ie) 



(87) 



which exactly corresponds to Eq. ([68)1 for the real-gluon 
case. Unlike the real-gluon case, we now have to worry 
about the Glauber region, |^/^| <C l 2t , since the in- 
tegration over a virtual gluon momentum includes arbi- 
trarily small values of l^ and . To obtain the soft ap- 
proximation we needed the assertions l 2 • $ — l 2 & + an d 
J ■ l 2 — J~l 2 ■ The first assertion becomes invalid when 
I2 is too small, and the second assertion becomes invalid 
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FIG. 21: The deformation of the contour for the integral over 
It in Eq. (g7| 



of both parts of the soft approximation can be deduced 
from Sec. IVIII CI 



A 2 



(T) : 



h 



A 



+^2 



(P+) 
A 2 



► A. ( s.„ 



These estimates assume that the transverse momentum 
in the jet and target subgraphs are of order A. The first 
pair of conditions simply state that the rapidity of the 
gluon must be well inside the range between the jet and 
target rapidities, as is natural for the soft region. The 
second pair of conditions are that the longitudinal com- 
ponents of l 2 should not be too much smaller than the 
transverse momentum; from them can be deduced that 
we need 



A2 
Q 2 ' 



(90) 



when I2 is too small. The conditions for the validity 



A breakdown only of the conditions on the rapidity of 
l 2 simply takes us to one of the collinear regions, and 
that need not concern us here since we will treat the 
collinear regions separately. However a breakdown of the 
other conditions is problematic. We see from (|90)) that 
such a breakdown brings us to the Glauber region, i.e., 
to \l+l 2 \ «l 2 jt . 

As explained in [35|, |4J] , we can apply a contour de- 
formation to get out of the Glauber region. The contour 
deformation is to be applied to only, since the only 
significant dependence on l^ in the Glauber region is in 
the jet subgraph. All the relevant singularities are final- 
state singularities, and are therefore all in the upper half- 
plane; thus the same deformation works for all graphs. 
This is shown graphically in the complex plane shown 
in Fig. [21] where the crosses represent the final state 
poles. An attempt to apply a corresponding argument 
to the other longitudinal component l^ would fail, be- 
cause there can be both initial- and final-state singulari- 
ties for I2 in the target subgraph. For the application of 
a Ward identity, it is essential to have a single contour 
deformation applied to l 2 for every graph that is summed 
by the Ward identity. The choice of ie in the eikonal de- 
nominators, particularly nj ■ l 2 — ie, was determined by 
compatibility with the contour deformation of l^ away 
from final-state poles in the jet subgraph. Depending on 
the size of l^ , the contour deformation may take l 2 to the 
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conventional soft region or to the target-collinear region. 
In either case we have a situation for which we have an 
applicable technique. 

Now that we have deformed the contour integration out 
of the Glauber region, the soft approximation is valid over 
the whole of the soft region, as defined by the rapidity 
condition. Therefore we can apply Ward identities to 
the sum over graphs, just as with real-gluon emission, to 
obtain 



d 4 k 



p 2 P 

vc,soft 4n J ( 27r )4 

Tr [Vj'fVj J(k + q) VrfVr $(k, P)] x 
1% g 2 C F n*G Kp n P T 



(27r) 4 (nj • h — ie)(riT • h + ie) ' 



(91) 



where we have performed summation over colors and the 
strong coupling constant has been taken out of the ver- 
tices J K and $ p . The soft gluon has factorized from the 
rest of the graph. Effectively the soft gluon only sees 
the total color charge and direction of the target and jet 
lines, and has no sensitivity to the details. 
The expression in the last line of (f9Tj) 



= g 2 C F 



(2tt) 4 (nj • l 2 ~ie){n T ■ l 2 + ie)' 



(92) 



is the lowest order term of the vacuum expectation value 
of two Wilson lines, exactly corresponding to the relevant 
term in our definition of the soft factor in Eq. (|4"3"|) when 
the intermediate state between the left three Wilson lines 
and the right three is the vacuum state. 

The use of non- lightlike lines, in directions tit and nj 
cuts off the rapidity divergences that would occur if light- 
like lines were used [45(. We could follow an alternative 
procedure, suggested in j45| of using light-like Wilson 
lines, but with extra generalized renormalization factors 
in the definition of the soft factor to cancel the rapidity 
divergences. But we will not follow this idea here. 

We do remark that that Eq. (|52")1 does have a UV di- 
vergence. We define it to be removed by applying renor- 
malization, as usual; this is an ordinary UV divergence 
associated with the cusp joining the two segments of the 
Wilson line. 




FIG. 22: Graphical representation of the target-collinear con- 
tribution for one virtual collinear gluon, as in (|95|l . From this 
is to be subtracted a soft term. 



we apply the ie prescription to the eikonal denominator 
that is compatible with the contour deformation: 



Y J K (k + q,l 2 )r ® P {kM,P) G Kp 

* l v J(k + q, h) ■ h r , <G "" ■ $ P (fc, h,P) • (93) 
h ■ n s - ie 



This leads to a first form of the approximator: 
d 4 fc f d% 



T T1 T V 



e 2 P 

47T 



(2tt) 4 J (2tt) 4 



Tr^ J(k + q,l 2 )-l 2 r ® p {k,l 2 ,P) 



TT-sGnp 



2 • n s 



(94) 



Naturally, we must use the same eikonal denominator for 
both real and virtual gluon emission in order that all 
the contributions to the target PCF arise from the same 
Wilson line. 

The remaining steps follow exactly as for real emission: 
(i) Subtraction of a soft term as in Eq. (|75|) to compensate 
double counting with the smaller region, (ii) Application 
of a leading-power approximation to the hard scattering, 
(hi) Use of Ward identities. The result is: 



i 

2^ 



d 4 k 



1 



(2^) 4 -q+q- 



\H (q,k)\ J-(k + q)F (Vjl) (k,P), (95) 



B. Target-collinear region 

The treatment of the target-collinear region for l 2 
works exactly as in the real emission case. But we 
must apply the approximators and the subtraction to the 
graph with the contour deformed (to avoid the Glauber 
region), in order for the double-counting subtraction for 
the soft region to be correct. This requires that in the 
Grammer-Yennie approximation for the collinear region 



with the 1-gluon virtual gluon contribution to the parton 
correlation function defined as 



F (v ,i)(fc,P) = 



(2tt) 



$ p (k,h,P) 



-gC F n K s G Kp 



li ■ n s 



it 



<S>+(k,P)S( V ' 1 \y T -y s ) 



(96) 



This is exactly the sum of the contributions caused by 
one virtual gluon coupling to the Wilson line(s) in the 
target PCF. 
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C. Jet-collinear region 

The case in which the gluon is collinear to the outgo- 
ing jet can be worked out in complete analogy with the 
previous cases. The result is 

\H {q,k)\ 2 J(V,l)(k + q)$ + (k,P), (97) 

where 

J s(kP\- [ dH2 7«(k 1 P\ -9CFG Kp nP 
J(v,i){k,P) - J j^J (k,l 2 ,P) h , ns _ ie 

-J-(k,P)S^ 1 \y s -y 3 ). (98) 

To summarize, a graphical depiction of the resulting fac- 
torization for the virtual contribution is shown in Fig. 1221 

D. Hard vertex correction 

We have simplified our work by restricting to final 
states without particles or jets of extra high transverse 
momentum. Thus we did not need to treat real correc- 
tions where extra partons are emitted from the hard scat- 
tering. We treated collinear and soft gluons, but not hard 
gluons. 



For virtual corrections, the situation is different: loop 
momenta are not restricted by the external states, and 
can be arbitrarily large. However, in the graphs entailed 
by Fig. 1201 the only one in which a hard gluon gives 
a leading-power contribution is in the vertex graph of 
Fig. [23] This graph is, of course, to be considered as a 
particular subgraph for Fig. [20] 

So we now present the correction to the hard scatter- 
ing coefficient, associated with the vertex graph. To it is 
to be added the complex conjugate contribution from the 
same correction at the current vertex on the right of the 
final-state cut. Let us use V to denote the vertex graph. 
Its contributions where the gluon is soft or collinear have 
already been allowed for, so there remains only the con- 
tribution from the hard region. In accordance with our 
subtraction formalism, this contribution is 
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FIG. 23: One loop correction to current vertex. 



C H V = T h [1-T s - T t (1 - T S ) - Tj(l - T s )} V 
= T H [1 - T T - Tj - (1 - T T - T 3 )T S ] V 

_ -ig 2 [a 1 „ J 7 Q 7 ' (1 ~ r ) 7 M 7 ' - r ) la 



<fr- -V T < 



(2tt) 4 J r 2 +ie [ [(I - r ) 2 + ie] [(k - rf + ie] 

7 M 7 ■ (k — r) 7 • n s 7 • n s 7 • (/ — r) 7 M 



>'P T 

(—r ■ n s + ie) [(k — r) 2 + ie] [(I — r) 2 + ie] (— r ■ n s + ie) \ 
+ UV counterterm. (99) 

Our normal recipe for a hard scattering required us to insert the projection matrix Vt at each side, and to use the 
massless on-shell external momenta k and I that were defined earlier. We also used the fact that in Feynman gauge 
the three subtraction terms involving Tg exactly cancel. These terms — T# (1 — Tt — Tj)TsV are like those in Eq. (|99|) 
but with the factor in braces replaced by 

uj -u T : n s -u T , uj ■ n s 



(— uj ■ r + ie) (-itT • r — ie) (—n s ■ r + ie) {— mt • r — ie) (—uj ■ r + ie) (—n s ■ r + ie)' 



which is exactly zero. Since we take the massless limit 
in the hard scattering, we have replace the non-lightlikc 
vectors nx and nj by their lightlike counterparts ut and 

Uj. 



X. FULL FACTORIZATION 



We now have enough techniques to obtain a full factor- 
ization formula, valid at the leading power in Q including 
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all logarithmic corrections. However, we will restrict our 
derivation to a model theory with massive Abelian glu- 
ons. This avoids certain complications with Ward iden- 
tities in a non-Abclian theory and with actual IR di- 
vergences associated with the masslessness of the gluon. 
These complications we leave to later work. The model 
exhibits the issues of gluons coupling subgraphs with dif- 
ferent kinds of momenta and the need for appropriate 
Wilson lines in the definitions of the PCFs. Even so, the 
formulation of factorization, with the definitions of the 
PCFs already exhibited, is equally applicable to QCD. 

As we have done throughout in this paper, we also re- 
strict to final states with low transverse momenta in the 
Breit frame. Thus the leading regions do not involve ex- 
tra groups (or jets) of collinear partons emitted from the 
hard scattering. Since this is a restriction on the final 
state, and our approximation methods leave unaltered 
all final state momenta, this is a safe restriction appro- 
priate for exhibiting the simplest and some of the most 
important cases where PCFs are important. 

In Fig. [Ml we give a graphical overview of the proof: 

• In graph (a) we symbolize the most general leading 
region. It has a connected hard subgraph associ- 
ated with each of the external vertices for electro- 
magnetic current. It has a connected collinear sub- 
graph for the target and for the jet, each of which 
has, in addition to the standard quark connection, 
arbitrarily many gluons connecting it to the hard 
subgraphs. It has a soft subgraph connected by 
gluons to both collinear subgraphs. The soft sub- 
graph has arbitrarily many connected components 
(including zero as a possibility), and each compo- 
nent must couple to both collinear subgraphs. 

Let R denote any such region of a general graph T 
for DIS. 

• We then apply the operation CrT to obtain the 
contribution associated with this region. We can 
now understand graph (a) to denote CrT with a 
sum over R and T to give the complete leading- 
power contribution. Subtractions, as defined in Eq. 
(|51[) . ensure that the contributions from smaller re- 
gions are suppressed. That is, when the loop mo- 
menta are close to the defining surface of a smaller 
region R' y there is an actual power-law suppres- 
sion, and when the loop momenta are far from this 
surface, there is a subtraction to prevent double 
counting between CrT and Cr'T. 

• In accordance with our earlier discussions, we do 
not apply any restriction on the internal momenta 
of the graph; any suppression comes purely from 
overall momenta conservation, from the nature of 
the subtractions, and/or from any restrictions ex- 
plicitly imposed in the definition of the approxima- 
tors. 

• Then we apply Ward identities to the connections 
of the soft subgraph to the collinear subgraphs. 



One of the reasons we have been insisting on not 
applying restrictions to the internal momenta in a 
graph, is that this is necessary in order that Ward 
identities work exactly. The derivations of Ward 
identities involve shifts of loop momenta, so that 
any internal restriction on loop momenta is liable to 
violate the Ward identity: A shift of an integration 
variable near a boundary can move a momentum 
across the boundary. 

This contrasts with the situation in Soft-Collinear 
Effective Theory (SCET) 0, and other methods 
related to a Wilsonian renormalization group. In 
SCET, fields are defined with explicit restrictions 
to particular localities in momentum space, and it 
is not at all obvious how this is to be actually im- 
plemented, at least not with the exact preservation 
of Ward identities. 

In the present context of deriving a proper factor- 
ization formula, with the anticipated power sup- 
pressed corrections, a non-exact Ward identity 
presents a serious problem. In a particular re- 
gion, one may be tempted to apply an approxi- 
mate Ward identity to the sum over graphs. Close 
to the particular region under investigation, the 
terms that violate the exact Ward identity may be 
power suppressed. However, within the subtrac- 
tion formalism (which allows for a proper derivation 
of a factorization formula) we encounter equations 
like (|5ip . where the approximated graphs are used 
far outside their corresponding regions, where the 
leftover terms in the Ward identities are not small. 

• In an Abelian theory, application of a Ward identity 
is entirely straightforward, and after the sum over 
all relevant graphs the gluons from the soft sub- 
graph are attached to vertices for Wilson lines, one 
Wilson line for each quark at the hard scattering. 
We must define the approximator for the region R 
so that the approximated hard subgraphs are ex- 
actly independent of the soft momenta. Then the 
Wilson lines meet at a point and we have separated 
out a soft factor — Fig. [24T bV The soft factor is a 
PCF defined by Eq. (@3J). 

There are some subtleties in defining the approxi- 
mators so that the hard subgraphs become exactly 
independent of the soft momenta; we will return to 
this issue. 

• The Ward identities apply most directly to the 
approximated graph TrT; then the collinear sub- 
graphs are just ordinary Green functions. How- 
ever, the double counting subtractions for smaller 
regions — Eq. (|oTj) — change this situation, and 
we must also ensure, as we will do later, that the 
double counting subtractions are compatible with 
the Ward identities. 

• Finally, we apply Ward identities to the gluons con- 
necting the collinear subgraphs to each of the two 
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(c) 

FIG. 24: Conversion to factorized form: (a) All leading reduced graphs with generalized handbag structure, (b) After application 
of Ward identities to connection of gluons from soft subgraphs to collinear subgraphs, (c) After application of Ward identities 
to connection of collinear subgraphs to hard subgraphs. 



hard subgraphs. This gives Fig. [24jc), giving the 
jet and target PCF factors. We will have to inves- 
tigate more carefully the subtractions for smaller 
regions, and we need to show that these produce 
the denominators in the definitions (|45p and (|47|) . 



Loop graphs have subtractions for non-hard re- 
gions, so that we have a properly defined hard co- 
efficient. 

A. Factorization formula 



• The hard subgraphs are now restricted to having 
the same external lines as in the parton model. 



The resulting factorization formula is 
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(2tt) 4 (2tt) 4 (2tt) 4 

x I#(<3,m)| 2 S(ks,yT,VJ,^) F mo<i (k T ,yp,y T ,y s , fi) Jmod(kj,yj,y s , v), (101) 

where S 1 , F mo a and J mo d are the Fourier transforms [Eq. (|48[) ] into momentum space of the soft, target, and jet PCFs 
defined in Eqs. (|43|) . (j45|) . and (T47|) . The variables fcx, fcj, and fcs are the momenta of the final states in the target, 
jet, and soft PCFs. 

This formula can be simplified. The target (and jet) PCFs have two rapidity arguments for Wilson lines, which is 
in contrast to the situation in the CS formalism. This can be changed by moving the denominators from the target 
and jet PCFs to the soft PCF, so that the soft PCF is redefined to 

a < \ S{w,y T ,y.hV) nno , 
Si{w,y T ,yj,y s ,fj,) = -~- — ~- -. (102) 

&(w,yT,y s ,ti>) b{w,y s ,yj,(i) 

In this soft factor, the denominators remove the contributions from large positive and negative rapidities. We expect 
to be able to take the limits that the vectors nx and nj become light-like: 

S 2 {w,y s ,n) = §i(w,+oo, -oa,y s ,Li). (103) 

This corresponds to the soft factor defined by Collins and Soper [2| . 
Then the factorization formula becomes 

x \H(Q,fi)\ 2 S 2 (k s ,y s ,n) F(k T ,y p ,y s ,n) J(&j,y s ,/z), (104) 



where we use the target PCF F defined by Eq. (j4"4"|) in- 
stead of -Fined (and similarly for the jet PCF) defined by 
Eq. (f45|) . Note that there is a difference from the CS 
case, where the soft factor is independent of y s . The CS 
soft factor is defined appropriately for kt factorization, 
so that it is the integral over kg and kg of the soft factor 
defined here, to give a function of transverse momentum 
alone. This quantity is invariant under boosts in the z 
direction. In contrast, our soft factor depends on longi- 
tudinal momenta as well. 



B. Momentum routing in approximators 

It is important for factorization that the hard scatter- 
ing coefficient should depend only on Q 2 . It should not 
depend on the loop momenta in the soft factor. There- 
fore in constructing the approximator for a region, we 
need to arrange that the hard subgraphs are approxi- 
mated as independent of the soft momenta. In addition, 
so that the hard scattering coefficients can be treated as 
on-shell matrix elements (modified by subtractions), we 
will approximate their external parton momenta by on- 
shell massless momenta. So that we may be sure that this 
can be done in general, for a graph with arbitrarily many 
soft and collinear gluons, we must be certain that the ap- 
proximators that we use do not introduce any anomalous 
dependence on soft gluon momenta, and that all approx- 



imations are exactly consistent with the application of 
Ward identity relations. 

The difficulty in constructing a general prescription 
obeying these requirements can be seen from the sim- 
ple case of Fig. [TTJ There, the trouble comes from the 
correct choice for "routing" momenta. By a choice of 
routing we mean the choice of which momenta are to be 
treated as independent variables. Different choices cor- 
respond to different explicit appearances of momentum 
variables around different loops in the graph. Of course, 
the choice of momentum labeling is arbitrary and has 
no effect on an unapproximated Feynman graph. How- 
ever, the approximators are defined with respect to a 
certain set of variables. Hence, the same instructions for 
approximating a graph will lead to different results for 
different routings of momentum. In other words, we can 
say that the approximators are not completely defined 
until a choice of momentum routing is made. 

For example, in Fig.[TT]the soft momentum l 2 is routed 
through the left-hand vertex. Thus, it appears natural to 
define the approximated momenta for the hard vertex by 
replacing l 2 by zero and by then keeping only the minus 
and plus components of the quark momentum. That is, 
in the hard vertex we perform the replacements q + k — 
h i-> (0, q~ + k~,0 t ) and k - l 2 i-> (k + ,0,0 t ) on the 
external momenta of the left hand current vertex. At 
the right-hand vertex there is no soft momentum, so we 
simply make the replacements q + k i— > (0, q~ + Ot) 
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and k i-> (fc+,0,O t ). 

Observe that the value of minus momentum on the 
quark line going to the jet was changed on the left-hand 
side, but not on the right-hand side, and similarly for the 
quark from the target subgraph. We can reverse this sit- 
uation by simply changing the routing of l 2 to go through 
the right-hand current vertex, for example by changing 
variables to k\ = k — l 2 . In that case the momenta at the 
left-hand vertex are q + k\ and k\ , while the momenta at 
the right-hand vertex are q + k\ + l 2 and k\ + l 2 - Thus 
the definition of the approximation varies depending on 
the routing of the loop momenta, and the two routings 
we have shown are equally legitimate. 

In the true soft region, the components of l 2 are small 
compared with Q and so the difference between the def- 
initions is a small, power-suppressed effect. But we per- 
form an integral over all kinematically accessible mo- 
menta, so the difference amounts to a genuine inconsis- 
tency. 

Nevertheless, the inconsistency did not affect our treat- 
ment of Fig. [TTJ because its hard subgraphs are single 
vertices and hence independent of momentum. But if 
we consider a more general situation, as in Fig. 1241 with 
non-trivial hard subgraphs, the momentum dependence 
of the hard subgraphs creates an inconsistency between 
(at least) two possible definitions of approximation for 
the hard subgraph. 

A further problem appears when we observe that the 
approximated minus momentum on the q + k — 1% and 
q + k lines is not q~ , and that the approximated plus 
momentum on the k — l 2 and k lines is not — q + . This 
creates the situation that, beyond lowest order in the 
hard scattering, the hard scattering coefficient does not 
depend just on Q. Again, in a situation of really collincar 
momenta this is a small effect, but we integrate the PCFs 
out beyond this region. 

These issues are closely related to the kinematic incon- 
sistencies in parton showering algorithms that were found 
by Bengtsson and Sjostrand [27j, and that led [3j, U, |5j, |6J 
to the proposal to use PCFs rather than regular parton 
densities. 

The essential difficulty in defining the approximations 
is that we have to know what are the independent vari- 
ables in the various subgraphs. However, the momenta 
of different lines are constrained by momentum conser- 
vation and are not all independent. The dangers are 
made even worse by our pervasive use of Ward identities. 
Graphical proofs of Ward identities require shifts of in- 
tegration variables. Consistency of Ward identities with 
the approximations requires that the approximations be 
invariant under certain reroutings of loop momenta. As 
we will now see, a consistent prescription for labeling mo- 
menta involves treating all of the outgoing parton mo- 
menta as independent variables, but treating the photon 
momentum as a dependent variable. 

Our solution is two fold: In the first step we route 
all the momenta from the soft subgraph out through the 
hard vertex; i.e., the photon momentum is not treated 




FIG. 25: Definition of momentum variables for a given region. 
The subgraphs are coded by color as: red = soft, blue = jet- 
collinear, green = target-collinear, black = hard. 

as an independent variable, and is not fixed to q. See 
Fig. US] for a graphical depiction of the basic setup. This 
choice of independent variables is sufficient to treat both 
hard subgraphs the same way, and gives a routing con- 
sistent with all the Ward identities we apply. The second 
step, after approximating the external lines of the hard 
subgraph as before, is to define a new set of variables 
by rescaling the approximated minus and plus momenta 
so that outgoing jet momenta sum to (0, q~ , Ot) and the 
incoming jet momenta sum to (— </ + ,0, t ). We end up 
with a treatment of momenta that is similar to the pre- 
scription of Collins and Zu [f|, but is more complicated 
because of the need to treat soft subgraphs. 

Now let us go through the steps described above ex- 
plicitly. We let l S2 denote the collection of momenta en- 
tering the jet-collinear subgraph from the soft subgraph. 
Then we write the collection of momenta leaving the hard 
subgraph to the jet-collinear subgraph as 

Ij - Mjl_ S2 . (105) 

Here Mj is an incidence matrix. Thus for a particular 
outgoing jet line, labeled by an index, j, we have 

hj-^Mjjklsik, (106) 
k 

where the sum is over the soft lines outgoing from 
the subgraph J . Similarly, we write the collection of 
momenta entering the hard subgraph from the target- 
collinear subgraph as 

1_ T + M T 1_ S1 . (107) 
Thus for a particular incoming target line, we have 

hj+^M Tjk l S ik- (108) 
k 

where the sum is over the soft lines outgoing from 
the subgraph Momentum conservation on the final 
state will impose some constraints between the momenta 
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within Zj, within l T , and within (IsiiLsi)' but this will 
not affect our argument. Within the hard subgraph, we 
define the approximator so that it makes the following 
replacements: 

h-Mjl_s 2 ^Xj(0,lj,Ot), (109) 
Z T + M T 1_ S1 i ► AtUt » °> °t)- (HO) 

That is, we project down to the minus and plus compo- 
nents of the collincar momenta only. After this projec- 
tion of momentum components, the next step is to apply 
a rescaling so that the momenta sum to q in the hard 
vertex. More specifically, (I109j) and (|110p mean that for 
each of the outgoing and incoming lines, we project onto 
a new set of variables as follows: 

l 3j - E M Jjk l S 2k ' * hj = Aj(0, IJ^Ot), (HI) 

fe 

hj + E M Tjk Isik >-> hj = AtGtj- 0. Ot). (112) 

fe 

As before, a "hat" on a momentum variable indicates 
that it is an approximated momentum variable. The scal- 
ings are determined from the condition that the incoming 
target collinear momenta add up to (— q + , 0, t ) while the 
outgoing jet collinear momenta add up to (0, q~ , 0*), 

E^ = E A -' (°>^> *) = (°> «">«*) . (us) 

E h = At (*Tj> °> °t) = 0, t ) . (114) 

j j 

Hence we get the following definitions for the scalings: 
A.i = ^— , (H5) 

At = (116) 
^j hj 

We also define the approximator so that the fermion lines 
are equipped with projection matrices, Vt in the ampli- 
tude and Vi in the complex conjugate. At the gluons we 
apply an appropriate Grammer-Yennie-style approxima- 
tion, as in Eq. ([76]) . 

To summarize, we have generalized (and modified) the 
replacement scheme of Sect. [IV] and Ref. @ in such a way 
that we now consistently deal with the presence of soft 
gluons. 

There is one non-obvious step for the Grammer-Yennie 
approximation that arises from the replacement of exact 
momenta with approximated momenta in the hard sub- 
graph. For a gluon coupling to the hard subgraph from 
the target-collinear subgraph, we make the replacement 

n K 

H K (l Tj , . . .) " s — hj • Hi l Tl . . . .). (117) 

lTj ■ n s — ie 

Here hj denotes the momentum of the line flowing into 
the hard subgraph on the gluon line. In the hard part we 



replace this momentum by its approximation defined in 
Eq. (|110p . So that the relevant Ward identity is exactly 
valid, we have performed the same replacement in the 
factor of momentum contracted with the hard subgraph. 
In contrast the denominator is left unaltered. This en- 
ables the n s vector to fulfill its purpose of cutting off 
the integral over the rapidity of hj ■ This unaltered de- 
nominator also ensures that we get exactly the expected 
Wilson line operator for the target PCF. 

Similar definitions apply to the gluonic connections 
from the other collinear subgraph. To summarize, the 
sequence of replacements listed in this subsection defines 
an approximator which (a) leaves the hard scattering sub- 
graph independent of soft momenta, and (b) allows for 
the exact application of Ward identities. 

C. Ward identities for soft into collinear subgraphs 

We first examine the Ward identity argument for the 
connection of soft gluons from subgraph B to the jet sub- 
graph J in Fig. [24Ta) . For each soft gluon the Grammer- 
Ycnnic approximation is applied, by a replacement of the 
form 

J*(ls2j, ...)"-» 1 — ls2j ■ J(l S 2j, ■■■), (H8) 

l>S2j ■ n>3 — «e 

where ls2j is the soft-gluon momentum oriented to enter 
subgraph J . Now the sum over regions and graphs for 
the whole process can be written as independent sums 
over the different subgraphs, subject to consistency on 
the kinds of lines joining them. So we now sum over sub- 
graphs J with the other subgraphs fixed, and also apply 
the same argument for the connection of the soft sub- 
graph to the target-collinear subgraph. In the absence of 
the double-counting subtractions, a standard Ward iden- 
tity applied to every gluon connecting the soft subgraph 
to the collinear subgraphs would result in Fig. 123T b). 

However, to each graph is applied a series of subtrac- 
tions from smaller regions, as in Eq. (|5ip . so we must ex- 
amine their compatibility with the Ward identities. So let 
R denote a particular region of the form of Fig.H?4Ta), for 
which we wish to construct its contribution as in Eq. (I5ip . 
The direct Ward identity argument applies to the term 
TrT. Subtractions concern all possible strictly smaller 
regions R' < R. Each R' assigns a momentum cate- 
gory (hard, collincar of some type, soft) to each line, and 
implicit in the subtractions in C^r are therefore approx- 
imations appropriate to regions smaller than R. 

The following discussion entails considering the rela- 
tion between the two different regions R and R' for a 
single graph T and also treating a sum over the possibil- 
ities for r, R, and R' , as in 

E^ r = E r *( r -E^' r ) 

r,i? r,R r'<r 

= Y( TrT - E t R t R'^ + ■■•)■ (H9) 

V,R R'<R 
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Each region can be specified in terms of its J, <I>, 7i and 
B subgraphs, so we will use a notation J{R), B{R) etc 
for the jet and soft subgraphs, etc, associated with the 
region R. The region R' has its corresponding J'(R'), 
B(R') subgraphs etc. 

Since R' is smaller than R, it imposes tighter con- 
straints on the momentum categories of the lines. Hence, 
as regards the soft lines, the set of lines categorized (or 
labeled) as soft for R' is at least as big as the set of lines 
labeled as soft for R. That is, the soft subgraph of R' is at 
least as big as the soft subgraph of R, i.e., B(R') 2 B(R), 
as shown in Fig. 1261 In forming the term TrCr/T in Eq. 
(|119p . first the approximator Tri is applied and then Tr. 
(There can be further subtractions inside CriT, but that 
need not concern us here.) 

In the representation in Fig. the approximator Tr 
applies some operation at the boundary of B(R), while 
Tri applies some operation at the boundary of B(R'). 
(There are also operations applied at the boundaries be- 
tween the collinear and hard subgraphs, but that will be 
our topic later.) Since B(R') 3 B(R), the operation Tri 
has no effect on the soft subgraph B(R) for the origi- 
nal region R. But some lines of jet subgraph ^T(R) may 
be soft according to i?', so the approximator Tri has an 



effect inside J(R), when we form the term TrTriT. 

An example of the situation we wish to discuss is shown 
in Fig. [27] For each graph T\ and T2 in the left-hand 
column, suppose we have constructed for each graph the 
term Cr' Tj for the region shown in the right-hand col- 
umn where the two gluons are both soft. Next we wish to 
construct the the term Cr. Tj for the bigger region shown 
in the middle column, where only one gluon is soft and 
the other is jet-collinear. After application of the soft ap- 
proximation, a Ward identity, discussed below, will let us 
combine the two ways of attaching the red gluon to the 
upper quark line. There are also subtractions to prevent 
double-counting. The nature of the soft approximation 
lets us combine the subtraction terms related to the right- 
hand column in the same way as in the middle column, 
thus ensuring that the Ward identities are compatible 
with subtractions. 

Suitably viewed, this pair of graphs actually leads us 
to the general case. First, let us examine the standard 
graphical derivation of the Ward identity. The basic step 
is the following identity applied to the connection of a 
soft gluon to one quark line, after the replacement (|118|) 
is applied: 



-/- 



I ■ nj — ie 



-I- 



(120a) 



I ■ nj — ie f — m 
which can be written graphically as 

k-l 



(120b) 

On the left-hand side, the double line and the arrow at the end of the gluon denote, respectively, the first factor and 
the ls2j- factor in (|118j) . The double- line notation is to exhibit that its factor corresponds to Feynman rules for a 
Wilson line. The big dots at the vertices on the right hand side of (|120b ) are used to emphasize the special vertex 
where the Wilson-line component attaches to the rest of the graph. 

We now examine what happens when we take two different graphs for J^(R) that are related by having the gluon 
I attached on opposite sides of some other vertex, with another gluon l^. Embedded inside the subgraph J(R) 1 we 
have the following situation: 





(121a) 





(121b) 
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(a) (b) (c) 

FIG. 26: (a) Representation of subgraphs for region R. (b) Representation for smaller region R' . (c) Soft subgraph of R' 
overlaid on subgraphs for R. The color coding is the same as in Fig. 




(d) (e) (f) 

FIG. 27: (a-c) A graph for DIS, and two of its regions, (d-f) Another graph, and two of its regions. The diagrams in the 
left-hand column, all in black, are particular Feynman graphs for the process, i.e., a candidate for V. The colored diagrams in 
the middle column represent a particular region (a candidate for R) for each of these graphs where one of the gluons is soft and 
one is jet-collinear. The diagrams in the right-hand column represent a smaller region, a candidate for R' where both gluons 
are soft. The color coding is the same as in Fig. 1251 



An example of this situation is provided by Fig. 1271 with 
the gluon I being the red gluon in the middle column. 

After we sum over the two graphs, on the left-hand- 
side, there is a cancellation between the two terms where 
the special vertex is at the second gluon, i.e., of the first 
term on the right of Eq. (|121ap with the second term on 
the right of Eq. p21b|) : 




(122) 

Then follows a cascade of such cancellations when we sum 
over all ways of attaching gluon I to the unsubtracted jet- 
collinear subgraph, i.e., when we sum over the relevant 
possibilities for J{R), e.g., over the the middle column 



in Fig. H7] 

There remain only terms at the end of fermion lines, 
and those at on-shell ends give zero. Summing over all 
graphs for J{R) gives 




(123) 

where the dots indicate arbitrarily many gluon connec- 
tions, both between J(R) and the hard subgraph, and 
between J{R) and the soft subgraph B(R). Applying 
the same argument to all the gluons from B(R) to J(R), 
and then summing over all cases for B(R), gives accumu- 
lated Wilson line components at the left which exactly 
correspond to the Feynman rules for the nj Wilson line 
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FIG. 28: Lowest-order connection to hard subgraph of target- 
collinear gluon. The hooked lines on the outermost fermion 
lines indicate where fermion momenta are replaced by mass- 
less on-shell approximations. 



in the definition of the soft PCF. 

But for each term in the sum over J^(R), there is a 
sum over subregions B! < R. For the cancellation (|122[) 
to work in the presence of subtractions, we make a 1-to-l 
correspondence between the regions R and R' appropri- 
ate to the two graphs in (|121|) , considered as embedded 
in graphs for J(R). 

For example, in Eq. (|121a[) we can consider two regions: 
The larger region we call R a for the case that the other 
gluon l\ is inside J{R a ), i.e., is collinear to the outgoing 
jet — we refer to R a as the outer region. There can be 
a subregion R' a for which this gluon is labeled soft, but 
such that the fermions are still labeled collinear. The 
subtraction for this region involves applying a Grammer- 
Ycnnic approximation at the end of gluon l\. But when 
we exchange the two gluons, so that we are instead con- 
sidering Eq. (|121b[) , the same assignment of momentum 
types still corresponds to a leading region. Thus, we have 
a outer region Rf, and a subregion R' b for Eq. (fi2Tb| anal- 
ogous to R a and R' a for Eq. (|12La|) . In both of the cases 
shown in Eq. (|121[) we have subregions R'j (embedded in 
some graph for J^(Rj)), where the fermions are collinear 
and l\ is soft, and exactly the same approximation is ap- 
plied at the end of the gluon li in the second line. Note 
that the outer regions, R a and Rb, are not literally the 
same because they apply to a different overall graphs for 
J(Rj); the same applies to the subregions, of course. 

Of course, we have to sum over all possibilities for 
leading-power subregions. There is only a very limited set 
of cases, and the same argument applies to all the other 
possibilities. Hence the cancellation (| 1 22[1 also applies 
in the presence of subtractions. This means that, the 
Ward identities apply both to the unsubtracted graphs 
and the subtracted graphs, given our definitions of the 
approximators. 



D. Ward identities for collinear into hard 
subgraphs 

The situation is somewhat more complicated with the 
Ward identities for collinear gluons connecting to a hard 
subgraph. 

First let us examine a lowest-order connection of a 
target-collinear gluon to the hard subgraph, Fig. [551 The 
vector in the approximation is now n s instead of nj. Of 




(a) 



FIG. 29: (a) This is not an allowed connection of a target- 
collinear gluon to the hard subgraph, since the gluon and 
the attached quark are both target-collinear: the intermedi- 
ate quark is therefore collinear, not hard, (b) The corrected 
treatment of the graph: The hard subgraph is just the vertex 
for the electromagnetic current. 




FIG. 30: (a) and (b) The lowest-order possibilities for two 
target-collinear gluons attaching to the hard subgraph, (c) 
Subregion with h collinear and I2 soft is allowed only for graph 
(a) and not graph (b), but in the subtraction the intermediate 
quark is on-shell. 



the two terms in the elementary Ward identity (|120D . 
only the first survives. The second term is zero because 
in the hard subgraph the quark fcj is replaced by a mass- 
less one with an on-shell momentum, and the associated 
Dirac-matrix projector Vt is equivalent to using an on- 
shell Dirac wave function. Although the hard subgraph is 
like an on-shell matrix element, one graph is missing, Fig. 
59Ta). which has the target-collinear gluon connecting to 
the target-collinear quark. Thus the sum over graphs — 
really just one graph here — gives the appropriate Wil- 
son line factor for the target PCF. This procedure readily 
generalizes to all the unsubtracted graphs. 

The only subregion and hence the only subtraction for 
Fig-EFJis where the gluon is soft. As can be seen from Eq. 
(f79j) . this has no effect on the approximator as applied at 
the hard subgraph (the hard vertex), so the subtraction 
leaves the Ward identity unaltered. 

A complication concerning subtractions does arise be- 
cause there is no longer always a 1-to-l correspondence 
between the subregions R' used in the subtractions for 
different terms in the sum (over R and T) to which we 
wish to apply a Ward identity argument, of the kind we 
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FIG. 31: (a) The application of the Ward identity when both gluons are collinear. (b) The application of the Ward identity 
when one of the gluons is soft. 



applied to soft gluons in Eq. Q121ap . To illustrate the 
situation explicitly, we analyze the case of two gluons, 
labeled l\ and l 2 attaching to the jet quark line — the 
two graphs in Fig. [30] For each of these graphs, we will 
consider two regions. One is the region where both glu- 
ons are target collinear. We will call this region Rcc (the 
outer region). The second, smaller region is where l\ re- 
mains target collinear, but I2 becomes soft. We will call 
this region Res, and we will need to take this region into 
account in setting up subtractions. To connect with the 
previous discussion, we note that Rcc corresponds to the 
outer region, i?, and Res corresponds to the smaller re- 
gion, R'. (Of course, there are other subregions we could 
consider. We consider this collection of regions for the 
purpose of illustration.) 

Looking first at the outer region, Rcc, we can imme- 
diately sum over the two possible graphs (a) and (b), 
giving the appropriate two-gluon coupling to the Wil- 
son line for the target PCF. This is shown graphically in 
Fig. l3"lTa). Note that in the hard subgraph, the approxi- 
mator replaces the collinear momenta by their large com- 
ponents. Thus fej — 1\ — 1% is replaced by (— — 1%, kj, t ) 
(with some possible scaling as well) . Next we consider the 
smaller region, Res, which will be relevant for subtrac- 
tions. We will refer to the contribution from a particular 
graph to Res as the subregion for that graph. For the 
subtractions obtained from subregions for graph (a) the 
one possibility we consider here, Res, is that gluon l\ is 
collinear and gluon I2 is soft, and this generates a sub- 
traction for graph (a). But for graph (b) this region is 
not leading, and there is no subtraction. Thus the sub- 
regions do not correspond between graphs. (A similar 
situation occurs with the graphs and momentum assign- 
ments reversed.) However, as far as the exact application 
of the Ward identity is concerned, the contribution from 
graph (b) is not needed in the subtraction terms. This is 
because in the subtraction term for graph (a), depicted in 
graph (c), the approximator Tr cs for the subregion sets 
the intermediate quark line, that propagates between the 
two gluon attachments, on-shell, i.e., fcj — I2 is replaced by 
(0, kj,0 t ). (Note the extra "hook" that appears on the 



intermediate line connecting the two gluon attachments 
in graph (c).) In order for the Ward identity to work, 
there is, therefore, no need to also consider graph (b) in 
the subtractions. In other words, after the application of 
Tr cs , graph (c) is already in the desired form. The ap- 
plication of the Ward identity in the smaller region gives 
the graphical equation in Fig. l3"lT b). 

For the subregion subtraction, we also need to apply 
Ward identities to the gluon I2 that is labeled soft in the 
subregion. Without the approximator Tr cs , we would 
need a cancellation with something related to the crossed 
graph (b). But since the approximator discussed in the 
previous section makes the inner subgraph exactly inde- 
pendent of I2, this does not matter. It should be recalled 
that we specifically chose the definitions of our soft ap- 
proximators so that these steps would work, i.e., so that 
Figs. 1317a) and 1317b) would both be exact. 

A fully detailed and explicit treatment is left to a future 
work. 



XI. CONCLUSIONS, OUTLOOK, AND FUTURE 
WORK 

In this paper, we have set up a factorization framework 
for deep inelastic scattering in QCD in which the exact 
kinematics of the partons is conserved. We have shown 
that this seemingly simple requirement leads to a non- 
trivial and significant conceptual shift even at the lowest, 
parton model level. The requirement of exact kinematics 
means that one has to abandon standard integrated par- 
ton densities and fragmentation functions, and instead 
use the parton correlation functions. 

The exact treatment of kinematics imposes particu- 
larly stringent requirements on the methods by which 
factorization is formulated and derived in the pres- 
ence of gluon exchanges between the different subgraphs 
(collinear and hard) with different kinds of parton mo- 
mentum. We have shown in detail how this works for 
one gluon exchange. Then we extended the results quite 
generally, to obtain a factorization property with defi- 
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nite gauge invariant operator definitions for the PCFs 
as well as for the soft factor. These definitions posses 
additional parameters (related to rapidity) which are in- 
troduced via non-light-like Wilson lines, comparable to 
those in the Collins-Soper-Sterman formalism for TMD 
distributions. 

So far, the factorization formula we have derived in- 
volves a rather trivial hard scattering coefficient. How- 
ever, having precisely formulated the factorization for the 
zeroth order hard scattering, the structure of higher order 
hard scattering coefficients can be readily determined us- 
ing the subtraction formalism. A calculation of this type 
is in progress. 

We expect that a reasonably straightforward general- 
ization of the methods of Ref. 0] will be suitable. Thus 
the hard scattering will be obtained from on-shell par- 
tonic cross sections with subtractions that compensate 
double counting with regions associated with lower-order 
hard scattering. The subtractions remove singular contri- 
butions where some of the partons in the hard scattering 
become collinear or soft. Because of our use of exact par- 
ton kinematics, the subtractions will be applied point-by- 
point in parton momentum, and will result in hard scat- 
tering coefficients that are ordinary integrable functions 
of external parton momenta. Thus they will correctly 
represent the corrections to parton probabilities differen- 
tial in parton kinematics. In contrast, standard methods 
of collinear factorization result in hard-scattering coeffi- 
cients that contain non-trivial generalized functions, e.g., 
the well-known plus distributions. Such distributions are 
very singular and cannot represent the detailed differen- 
tial distribution of parton kinematics. They only give 
correct results for cross sections that involve a broad av- 
erage over parton kinematics. 

There is still much work to do. First, the evolution 
equations for the PCFs need to be derived, presumably 
a natural generalization of the the CSS equations. Then 
we need to obtain methods for higher-order corrections to 
the hard scattering matrix elements. We need to extend 
the derivations to a non-Abelian gauge theory. In addi- 
tion, we need to determine if and how this formalism can 



be recast in terms of PCFs defined with light-like Wilson 
lines, but with factors that cancel light-cone divergences, 
as in doll. 
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APPENDIX A: PROJECTION MATRICES 

The projection matrices are defined as follows: 

Vt = , Vi = i 7 + 7" , (Al) 

where 

7- = ^(7°-7 3 ), 7 + = -^(7° + 7 3 )- (A2) 

The projectors satisfy the following relations 

V T + Vi = 1 , (A3) 

Pj 7 ~=7 + ^J=G, (A4) 

T T 7 + = 7~ V T = , (A5) 

Vj = 7°7 5 j t 7° = V T , (A6) 

V Tl -Vj = ~f-, (A7) 

Vjj + Vt = 7 + (A8) 

V\ = V T , (A9) 

V] = Vi . (A10) 
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APPENDIX B: THE ELEMENTARY WARD IDENTITY 



We have made frequent use of Ward identities to disentangle soft and collinear gluons from the soft and collinear 
bubbles such as those in Fig. [He). In this appendix we review the derivation of the elementary Ward identity for an 
Abelian gauge theory. We treat the case where a soft gluon attaches to the outgoing jet bubble - the upper bubble 
in Fig. [TTJ Analogous results hold for the other situations we have considered that require Ward identities (such as 
when the gluon attaches at the target bubble) and follow from similar arguments. 

Consider the contribution to the outgoing jet bubble represented by the following sum of graphs: 
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Here the outgoing quark attaches to two additional gluons as it enters the final state on the left side of the final state 
cut. (The other ends of these extra gluons may attach anywhere else in the jet bubble). In this example, we assume 
that the final state quark is on shell. The soft gluon insertion is represented by the attachment with a double line 
and arrow, indicating that we have made the approximations discussed in the main text that lead to Eq. (|68p . In 
particular, the soft momentum, I2 has been contracted with the soft gluon vertex, and there is a division by l 2 ■ nj. 
The sum of graphs in (|B1|1 contributes the following factor to the full graph: 



-u{h)l Pl [ 1 j7 )/2 (7 r~Ti )Tp2 



h-nj l \k-fti-mJ \k-k-ftx-mj \/i - k ~ #1 _ #2 _ 



+ rV s M^ (1 — I ) ^ (1 — t~~h. ) /s f 7 — / — I — 1 ) • (B2) 

Let us call the first term A, the second term B and the third term C . Now we notice that we can eliminate quark 
propagators and utilize the Dirac equation by substituting the following trivial identities for k : 

k = — (k ~ k ~ m ) + (k ~ m ) m term A 

= — (k — k ~ fti ~ m ) + (k ~ fti ~ m ) m term B 

= — (k ~ k ~ fti ~ $2 ~ m ) + (k ~ fti ~ $2 ~ m )- m term C 

These are each of the form of a difference of the denominators for the two quark lines next to the vertex for the I2 
gluon, so that 

A = 7^ — "(M7pi ( 7 r~~Ti ) ^pz ( 7 5 1 1 ) . (B3) 

B = ~r^hr u ( ll ^pi ( 7 — t~h, h»i 1 

h ■ nj \/i - h - fx - m J \/i — h - fii - f 2 - m 
+-J—u{h) lpi (- 1 ) lp2 (- 1 ) , (B4) 

C = —, — u(hh Pl (- )7 P2 



h-nj 1 \l 1 -$ 1 -mJ 2 \k - k ~ ^1 - #2 _ '' 
+ -J—u(h) 7pi (- 1 ) lp2 (- ) . (B5) 

In the sum, all the intermediate terms cancel, to leave only the last term, 

A + B + C k , nj u ^ ll ^P 1 _ ^ _ m ) 7 P 2 _ ^ _ ^ _ m J ■ ( B6 ) 

The soft gluon I2 has been factored out of the rest of the graph leaving only an over-all factor corresponding to the 
eikonal line propagator and eikonal vertex. (See Ref. [47I for a review of Feynman rules involving eikonal lines.) 

Exactly the same pattern applies no matter how many gluons attach to the quark line. This results in the general 
identity for Nj extra gluons shown graphically in Fig. [32l 

As desired, the sum of graphs with a soft gluon insertion is replaced by the graph with no soft gluon, and an over-all 
factor giving the expected eikonal line. 

If we repeat the above argument for multiple soft gluon insertions, we obtain the graph shown in Fig. 1331 Each eikonal 
line gives a propagator factor, l/(hj ■ nj) for j running from 1 to N s (From here on, the variables {hi, Ii2, ■ ■ ■ , /i/v s } 
will denote the collection of soft gluon momenta). Figure [33] can be re- written in a way that corresponds more directly 
with the Feynman rules of Wilson lines if we note that the product of eikonal propagators can be written as, 

nj ■ h Ns ^ n } [nj ■ hi] [rij ■ (hi + h 2 )] x • • • x [nj ■ (hi + h 2 + ■ ■ ■ + h Ns )] ' 

The summation sign means that we sum over all permutations of the momenta {hi, /12, ■ ■ ■ , ^tv s } in the denominator 
on the right side. That is, Fig. [33] is equivalent to summing all possible ways of attaching the soft gluons to a single 
eikonal line. To illustrate, we show this graphically in Fig. [31] for the simple case of two soft gluons. 
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FIG. 33: Result of applying the Ward identity with multiple soft gluon insertions 
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An analysis similar to the one given above was given by 
Landshoff and Polkinghorne [301 ] and actually predates 
QCD. 

By "dressed" gluon, we mean the outgoing gluon with the 
final state bubble which represents all possible radiative 
corrections. 



